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© Vaccines and diagnostics derived from bovine diarrhea virus. 

© The nucleotide sequence of the genome for bovine 
diarrhea virus (BDVJ is disclosed. The sequence permits de- 
sign and construction of vaccines effective against BDV. 
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VACCINES AND DIAGNOSTICS DERIVED 
FROM BOVINE DIARRHEA VIRUS 

Technical Field 

This Invention relates to the field of vaccines 
5 and diagnostics for Infectious diseases. Specifically, 
it relates to the disease syndrome caused by bovine 
diarrhea virus, and to vaccines, therapeutics, and 
diagnostics derived from the genomic sequence associated 
with the BDV virus. 

10 Background Art 

Morbidity and mortality caused by bovine 
diarrhea virus (BDV) in dairy and beef herds is a 
worldwide unsolved economic problem. A subclinical form 
characterized by high morbidity and low mortality is 

15 endemic and is associated with diminished respiratory 
capacity, neonatal diarrhea, ulcerations in the 
digestive tract, immunodeficiency, and. in calf bearing 
bovines. abortion and teratogenicity. The disease 1b 
recognizable in calves, but adult carriers are difficult 

20 to identify. 

An acute form of the disease results from 
infection of the fetus in the first trimester of 
pregnancy. The course of this form of the disease is 
insidious. The calves may survive the first infection. 

25 but those that do become immunotolerant. and excrete 
live viruses. They cannot, survive a second infection. 
Since their capacity as carriers cannot be detected by 
titration of their sera, these animals are responsible 
for spreading of the disease from herd to herd. 

30 BDV also infects hog populations. In hogs, it 

is important to distinguish animals'as being infected by 
either BDV or hog cholera virus, since hog cholera is an 
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econoaically important disease . while the bovine 
diarrhea infection is of transient significance, and 
could . for the most part, be ignored. Hogs infected 
with cholera must be slaughtered, and since present 
diagnostic methods in hogs cannot distinguish between 
these two types of infection, hogs which are. in fact, 
only infected with BDV must also be destroyed. 

Present means of detection of BDV infection in 
calves are equally deficient, in that they rely on 
titration for antibodies in sera, which titration will 
fail to detect the immunotolerant calves. Thus, a 
diagnostic method is desired, but presently unavailable, 
which is capable both of detecting the presence of the 
virus in newborn animals with chronic infections, and in 
distinguishing between hog cholera virus and BDV 
infections. This could be accomplished either using 
antibodies with high affinity and specificity for the 
virus particles or using nucleic acid oligomeric probes 
capable of specific hybridization to the viral sequences. 

Similarly, in addition to the need for improved 
diagnostics, there is. at present, no effective vaccine 
which is successful in preventing the~epread~-of the 
disease caused by BDV. It is. of course, desirable that 
such a vaccine would confer long-term immunity, would 
not infect the fetus of the inoculated animal, and would 
have no undesirable side effects such as induction of 
immunotolerance to the virus, or depression of the 
immune system. These characteristics are difficult if 
not impossible to acquire in an attenuated or killed 
virus vaccine. Such vaccines, for the most part, 
constitute the present state of the art (Saurat. P.. et 
al, "La Maladie des Huqueuses N (1972) pp. 229-251. 
L'Expanslon scientific francalse Paris). Recently. 
Fernelius, A. L. . et al. ( Am J Vet Res (1971) 
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32.11963-1979) have reported a vaccine prepared from a 
high molecular weight soluble antigen obtained by 
density gradient centrifugation from BDV virus grown in 
embryonic bovine Kidney cells* 

The approaches used in the art for the 
detection of and protection against bovine viral 
diarrhea have been largely empirical and have not 
utilized refined Knowledge of the nature of the vector 
causing the disease. The bovine diarrhea virus has. 
however, been classified, along with hog cholera and 
border disease viruses as a pestivlrus which is a member 
of the family Tooavlridae (Porter field. J. S., "The 
Togavirions. Biology. Structure* Replication" 
Schlesinger. W.. Ed. (1980). Academic Press, pp. 
17-24). 

By analogy to other togaviruses. these viruses 
should contain a capsid protein and two or three 
membrane glycoproteins (HorzineK, M.C.. Non-arthrpod- 
borne Togaviruses (19B1). Academic Press. London. 
Epitopes which are capable of raising antibodies 
associated with neutralization and protection against 
infection are expected to be contained in the membrane 
proteins (e.g.. see Boere. W. . et al. J Virol (19B4) 
5^:572-582). The pestiviruses are also characterized by 
soluble antigens that are approximately 80 KD proteins. 
A 76 KD protein from BDV has. in fact, been used as an 
experimental vaccine (Fernellus. A.L.. et al. supra). 

Disclosure of the Invention 

The invention provides cDNA copies of the 
entire bovine diarrhea virus RNA genomic sequence. This 
makes available the entire repertoire of peptides 
synthesized by the virus, and maKes possible the 
preparation of proteins which contain epitopes effective 
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and specific in generating desired antibodies and, in 
providing cells suitable for production of monoclonal 
antibodies. The primary structure of the genome alBO 
provides the necessary information to construct 
oligomeric sequences useful as diagnostic probes. 

The protein products are thus able to serve as 
vaccines to protect animals subject to infection by this 
virus from subsequent illness* The accessibility of the 
entire genome provides opportunities Cor production of 
effective proteins, such as major virion components and 
individual virion subunits which would be unavailable 
using -native" production techniques, i.e.. from viral 
infection of tissue cultured cells. 

Accordingly, in one aspect, the invention 
relates to a nucleotide sequence substantially identical 
with that representing the entire genome of BDV as shown 
in Figure 2. Other aspects of the invention concern DNA 
or RN& sequences derived from portions of the genome, 
said sequences not necessarily representing contiguous 
portions. These are useful both as diagnostic probes 
and as coding sequences for desired proteins. 

Other aspects of the invention include 
expression systems for the foregoing DNA derived from 
BDV which are effective in expressing^ this -DNA in 
suitable heterologous hosts, including procaryotes. 
yeast, and mammalian cells. Live viral vectors, such as 
vaccinia , can also be used as carriers, and permit 
expression of the desired antigens along with the 
carriers- proteins in infected cells. Also included in 
the invention are ho&ts transformed with these 
expression systems and the proteins thus produced. The 
proteins produced in this way. or chemically synthesized 
to correspond to the deduced sequence, may be used as 
vaccines either alone, or in conjunction with carrier 
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proteins which enhance their immunogenic ity. In 
addition, the proteins may be used, either alone or 
conjugated with carrier* to elicit production of 
antibodies which are useful in diagnosis of carciers of 
5 the disease or in other immunoassays related to BDV. 

The invention also relates to methods for 
preparing these polypeptide vaccine s an d 
immunoglobulins, and to methods of using the materials 
thus prepared. 

10 Brief Description of the Drawings 

Figure 1 shows the map of overlapping segments 
of cDNA which* together, make up the entice BDV genomic 
sequence and cDNA fragments used to construct E. coll 

expression vectors. 

15 Figure 2 shows the complete nucleotide sequence 

for the BDV genome. The cDNA contains the identical 
sequence, except, of course, that T will be substituted 
for U. The deduced amino acid sequence, based on the 
open reading frame, and confirmed by expression of 

20 segments is also shown. 

Modes of Carrying Out the Invention 
A. Definitions 

As used herein, a nucleotide sequence 
" substantially identical 11 to the exemplified BDV genome 
25 refers to a sequence which retains the essential 
properties of the exemplified polynucleotide. A 
specific, but non-limiting example of such substantial 
equivalence would be represented by a sequence which 
encodes the identical or substantially identical amino 
30 acid sequence, but. which, because of codon degeneracy, 
utilizes different specific codons. Nucleotide changes 
are. indeed, often desirable to create or delete 
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restriction sites, provide processing sites, oc to alter 
the amino acid sequence in ways which do not adversely 
affect functionality. "Nucleotide sequence- refers both 
to a ribonucleotide and a deoxyribonucleotide sequence 
5 and includes the positive sense strand, as shown, and 
the negative sense strand as well. 

A DNA sequence "derived from - the nucleotide 
sequence which comprises the genome of BDV refers to a . 
DNA sequence which is comprised of a region of the 
10 genomic nucleotide sequence, or a combination of regions 
of that sequence. These regions are, of course, not 
necessarily physically derived from the nucleotide 
sequence of the gene, but refer to polynucleotides 
generated in whatever manner which have the same or 
15 "substantially identical" sequence of bases as that in 
the region(s) from which the polynucleotide is derived. 
For example, typical DMA sequences "derived from" the 
BDV genome include fragments encoding specific epitopes, 
fragments encoding portions of the viral polypeptide. 
20 sequences encoding the capsid proteins, sequences 

encoding deleted virions, and sequences encoding other 

useful viral genes. 

"Recombinant host cell6", "host cells", 
"cells", "cell lines", "cell cultures" . and other such 

25 terms denoting microorganisms or higher.eucarxotic cell 
lines cultured as unicellular entities, are used 
interchangeably, and refer to cells which can be. or 
have been, used as recipients for recombinant vector or 
other transfer DNA. and include the progeny of the 

30 original cell tranbf ected. It is understood that the 

progeny of a single parental cell may not necessarily be 
completely identical in morphology or in genomic or 
total DNA complement as the original parent, due to 
accidental or deliberate mutation. Progeny of the 
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parental cell which ace sufficiently similar to the 
parent to be characterized by the relevant property, 
such as the presence of a nucleotide sequence encoding a 
desired peptide, are included in the progeny intended by 
this definition, and are covered by the above terms. 

"Control sequence" refers to DNA sequences 
which are necessary to effect the expression of coding 
sequences to which they are ligated. The nature of such 
control sequences differs depending on the host 
organism; in procaryotes, generally such control 
sequences include a regulatory region promoter and 
ribosome binding site and termination signals; in 
eucaryotes, generally, such control sequences include 
promoters* terminators, and, in some instances, 
transcriptional enhancers. The term "control sequences" 
is intended to include, at a minimum, all components 
whose presence is necessary for expression, and may also 
include additional components whose presence is 
advantageous. 

"Operably linked" refers to a juxtaposition 
wherein the components so described are in a 
relationship permitting them to function in their 
intended manner. A control sequence "operably linked" 
to a coding sequence is ligated in such a way that 
expression of the coding sequence is achieved under 
conditions compatible with the control sequences. 

B. General Description 

At the center of the present invention is the 
provision of a nucleotide sequence containing the entire 
genome of bovine diarrhea virus. The availability of 
this complete polynucleotide permits the design and 
production of oligomeric probes for diagnosis, of 
vaccines effective against BDV, and of proteins useful 
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information available from the genome allows the amino 
acid sequence of the polypeptide to be deduced, and 
locations of favorable epitopes surmised. Further, once 
the desired sequences are chosen, appropriate fragments 
of the genome can be obtained and expressed 
independently, thus providing desired polypeptides. 
Short polypeptide fragments may also be chemically 
synthesized and linked to carrier proteins for use as 
immunogens. Recombinantly expressed polypeptides may be 
provided under conditions offering a favorable 
environment for processing into, for example, 
conjugation with cellular or artificial membranes which 
could thus bear the epi topic sites without the 
disadvantages of using an infectious virus. Mammalian 
and yeast cells provide suitable environments for such 
expression. In addition, the epitopes may be produced 
linked to a particle forming protein. 

The above proteins produced may. themselves be 
used as vaccines, or may be used to induce 
immunocompetent B cells in hosts, which B cells can then 
be used to produce hybridomas that secrete antibodies 
useful in passive immunotherapy and diagnosis. 

B.l. Nucleotide Sequence of tb «> BDV Genome 

The genomic sequence of BDV was obtained from 
cDNA clones representing oveclapping sections of the 
entire viral RNA genome (Figure 1) . The viral RNA was 
isolated from virus grown on bovine embryonic Kidney 
cells. The viral RNA was fractionated on sucrose 
gradients, and those fractions containing RNA of 
sufficient length to contain the intact genome were 
pooled, ethanol precipitated, and used to prepare a cDNA 
library. cDNA Inserts were screened initially using a 
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(*/-) system. Positive hybridizations were against RNA 
isolated from virus after lysis of infected cells, 
negative hybridizations were against RNA isolated from 
uninfected cells. One insert having the proper +/- 
5 response was then used as a reference clone to map the 
remainder of the library. Several colonies hybridizing 
to the positive insert were used to obtain additional 
portions of the viral genome from the cDNA library using 
"walking" techniques. Ten cDNA clones were obtained 
10 representing overlapping portions of the viral genome, 
as shown in Figure 1. and were subjected to restriction 
napping and sequencing. The entire genomic sequence was 
deduced from these ten cDNA inserts, and is shown in 
Figure 2. 

15 The illustrated DNA sequence and portions 

thereof are useful directly as diagnostic tools for 
detecting the presence of BDV in infected animals. 
These are particularly useful in distinguishing BDV 
infections from hog cholera virus. Methods to employ 

20 DNA hybridization in diagnosing disease have been 

disclosed in U.S. Patent No. 4.358,535 to FalKow. As 
set forth therein, biological samples may be used 
directly in obtaining Southern blots using suitable 
probes. Since the BDV genome is different from that of 

25 hog cholera virus , specific portions of the BDV sequence 
may be used to detect the presence of corresponding 
complementary sequences in biological samples from 

subjects suspected of harboring the Infection. 

* 

B.2. Preparation of viral Polypeptide Fragments In 
30 E. coll 

The availability of the entire genomic sequence 
permits construction of expression vectors encoding 
presumptively antigenically active regions of the virion 
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proteins. Fragments encoding the desired proteins are 
obtained from the cDNA clones using conventional 
restriction digestion and ligated into a series of 
vectors containing polyiinker sites in all possible 
reading frames to generate fusion proteins at the 
C-terminal end of B-galactoeidase. Eleven portions of 
the BDV genome were expressed as fl-gal fusions in Sa. 
coii using this approach, as outlined in Figure l. 
These portions were obtained by restriction cleavage 
and/or ligation of the ten original clones, or the 
original cloned sequences were used directly. The 
fusion proteins thus produced may be immunogenic. 

B.3. Pr oration antigenic Polypeptides and, 
pnnj uoatiop "*th Carrier 

peptide regions representing epitopes can be 
synthesized using chemical or recombinant methods, and 
provided with, for exampl*. cysteine residues at the C- 
ter*inus which provide means for linking the peptides to 
neutral carrier proteins. A number of techniques for 
obtaining such linkage are known in the art. including 
the formation of disulfide linkages using common 
reagents such as N-8Uccinimidyl-3-(2-pyridyldi- 
thio)propionate (SPDP) and succinimidyl-4-(N- 
naleimido-methyl)cyclohexane-l-carboxylate (SMCC) 

obtained from Pierce Company. Rockfar*. Illinois. These 
reagents create a disulfide linkage between themselves 
and peptide cysteine residues in one protein and an 
amide linkage through the c-amino on a lysine, or 
other free amino group in the other. A variety of such 
disulfide/amide-forming agents are known. See. for 
example. i-mn Rev (1982) 62:185. Other Afunctional 
coupling agents form a thioether rather than a disulfide 
linkage. Many of these thioether-forming agents are 
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commecc tally available and include reactive esters of 
6-maleimidocaproic acid, 2-brorooacetic acid, 
2-iodoacetic acid. 4-(N-maleimido-methyl) 
cyclohexane-l-carboxylic acid, and the like. The 
carboxyl groups can be activated by combining them with 
succinimide or l-hydroxy-2-nitro-4-sulf onic acid, sodium 
salt. The foregoing list is not meant to be exhaustive, 
and modifications of the named compounds can clearly be 
used. 

Any carrier may be used which does not itself 
induce the production of antibodies harmful to the 

* ■ 

subject, such as the various serum albumins, tetanus 
toxoids, or keyhole limpet hemocyanin (KLH) . 

The conjugates, when injected into suitable 
subjects, result in the production of antisera which 
contain immunoglobulins specifically reactive against 
not only these conjugates, but also against fusion 
proteins carrying the analogous portions of the 
sequence, and against whole BDV. 

B.4. Preparation of Mammalian Cell Membranes 
Containing BDV Epitopes 

Portions of the cDNA library comprising the BDV 
genome were also ligated into .expression vectors 
compatible with mammalian recombinant host cells; in the 
illustration below, into a mammalian/bacterial shuttle 
vector containing a linker sequence downstream of the 
SV40 early promoter, which, is followed by the poly A 
sequence also derived from SV40. Alternate vectors to 
this particular host vector. pSV7d. could, of course, 
also be used. The mammalian-compatible vectors 
containing the coding sequences for the desired 
polypeptides are then transformed into suitable 
mammalian cells for expression of the sequences and. in 



0208672 



-12- 
th e case of surface glycoproteins, transport of the 
produced protein to the membrane. The cells are 
ultimately harvested and used as whole cells in the 
formulation of vaccines, or the membranes are disrupted 
and portions of the membranes used correspondingly, or 
the proteins purified and formulated into vaccines. 

B.5. Preparation of Hybrid Particle Immunoqens 
Containing BDV Epitopes 

The immunogenicity of the epitopes of BDV may 
also be enhanced by preparing them in mammalian or yeast 
systems fused with particle-forming proteins such as 
that associated with hepatitis B virus (HBV) surface 
antigen (HBsAg). Constructs wherein a BDV epitope is 
linked directly to the particle-forming protein coding 
sequences produce hybrids which are immunogenic with 
respect to the BDV epitope, as well as to HBV epitopes. 

Hepatitis B surface antigen has been shown to 
be formed and assembled in S. cerevislae (Valenzuela et 
al. Nature (1962) 298 :344-350. The formation of such 
particles has been shown to enhance the immunogenicity 
of the monomer subunit. The particles can also be 
formed from constructs which contain the presurface 
(pre-S) region in addition to the mature surface 
antigen. The pre-S region encodes an immunodominant HBV 
epitope and these proteins are expressed in yeast 
(Neurath et al. Science (1984) 224:392-394). Expression 
of constructs encoding pre-S region fused to particle 
forming protein are disclosed in U.S. Serial No. 
621,756, filed 18 June 1984. Expression of coding 
sequences for hybrid particles containing HBsAg and a 
heterologous epitope are disclosed in U.S. Serial No. 
650,323, filed 13 September 1984. The foregoing 
applications are assigned to the herein assignee and 
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incorpocated by reference. These constructs may also be 
expressed in mammalian cells such as Chinese hamster 
ovary cells using an SV40-dihydrof olate reductase vector 
(Michelle et al. Int Svmp on Vital Hepatitis (1984)). • 

In addition, portions of the particle-forming 
protein coding sequence per se may be replaced with 
codons for an BDV epitope. In this replacement, regions 
whictr are not required to mediate the aggregation of 
units to form immunogenic particles in yeast or mammals 
can be deleted, thus eliminating additional hepatitis B 
antigenic sites from competition with the BDV epitope. 

B.6. Vaccinia Carrier 

Large, wide host range virus carriers have also 
been used in formulating vaccines by integrating the 
epitopic regions of the desired immunogen into the 
carrier viral genome. Vaccinia virus, in particular, 
has been used for this purpose. For example. Smith. 
G.L.. et al. Proc Natl Acad Set (USA) (1983) 
80:7155-7159. disclose the integration of the 
hemagglutinin gene from influenza virus into the 
vaccinia genome and use of the resulting recombinant 
virus as a vaccine, similarly. Panicali. D. . et al. 
. ibid (1982) 79.:4927-4931. cloned the thymidine Kinase 
gene from Heroes simplex virus Into vaccinia. The 
availability of the BDV genome of the invention offers 
similar opportunities. The recombination is generally 
done by co-infecting cells* both with vaccinia virus and 
with a chimeric plasmid carrying the desired coding 
sequence under the control of the transcriptional 
regulatory signals and RNA start site from the vaccinia 
virus gene adjacent to a translational start 
site/foreign protein coding sequence. During infection 
the similarity in the flanking DNA sequences of the 
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foreign DNA sequences to those in vaccinia causes 
integration of the desired portion of the chimeric 
plasmid into the vaccinia genome. The resulting 
recombinant vaccinia can be harvested from the infected 
5 cells and used in the formulation of a vaccine. 
Vaccinia virus has an extremely large (120 x 10 6 
dalton) genome, and may be very easily grown in 
culturtf. Hence* the production of large amounts of 
inexpensive immunogenic vaccine is readily possible. 



10 B.7. Preparation of Vaccines 

Preparation of vaccines which contain peptide 
sequences as active ingredients is also veil understood 

* 

in the art. Typically, such vaccines are prepared as 
injectables, either as liquid solutions or suspensions; 

15 solid forms suitable for solution in, or suspension in, 
liquid prior to injection may also be prepared. The 
preparation may also be emulsified or the protein 
encapsulated in liposomes. The active immunogenic 
ingredient is often mixed with excipients which are 

20 pharmaceutical^ acceptable and compatible with the 
active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol. or 
the like and combinations thereof. In addition, if 
desired, the vaccine may contain minor amounts of 

25 auxiliary substances such as wetting or emulsifying 

agents, pH buffering agents, or adjuvants which enhance 
the effectiveness of the vaccine. The vaccines are 
conventionally administered parenterally, by injection, 
for example, either subcutaneous ly or intramuscularly. 

30 Additional formulations which are suitable for other 
modes of administration include suppositories and, in 
some cases, oral formulations. For suppositories, 
traditional binders and carriers may include, for 
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example, polyalkaline glycols or triglycerides; such 
suppositories may be formed from mixtures containing the 
active ingredient in the range of 0.5* to 10\, 
preferably l\-2\. Oral formulations include such 
5 normally employed excipitents as, for example, 
pharmaceutical grades of manitol. lactose, starch 
magnesium stearate. sodium saccharine, cellulose, 
magnesium carbonate and the like. These compositions 
take the form of solutions, suspensions, tablets, pills. 

10 capsules, sustained release formulations or powders and 
contain 10\-951 of active ingredient, preferably 25\-70\. 

The proteins may be formulated into the vaccine 
as neutral or salt forms. Pharmaceutically acceptable 
salts, include the acid addition salts (formed with the 

15 free amino groups of the peptide) and which are formed 
with inorganic acids such as. for example, hydrochloric 
or phosphoric acids, or such organic acids as acetic, 
oxalic, tartaric, mandellc. and the like. Salts formed 
with the free carboxyl groups may also be derived from 

20 inorganic bases such as. for example, sodium, potassium, 
ammonium, calcium, or ferric hydroxides, and such 
organic bases as isopropylamine. t r ime thy 1 amine. 
2-ethylamino ethanol. histidine. procaine, and the. like. 

■ 

The vaccines are administered in a manner 
25 compatible with the dosage formulation., and in such 
amount as will be therapeutically effective and 
immunogenic. The quantity to be administered depends on 
the subject to be treated,. capacity of the subject's 
immune system to synthesize antibodies. ' and the degree 
30 of protection desired. Precise amounts of active 

ingredient required to be administered depend on the 
judgment of the practitltioner and are peculiar to each 
subject. 
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B. 8 . Preparation of Mabs Against BDV Epitopes 

The Immunogenic proteins oc immunocon jugates 
prepared as described above may be used to obtain 
peripheral blood lymphocytes and spleen cells in 
injected mammals to prepare hybridomas capable of 
secreting monoclonal antibodies directed against these 
epitopes. The resulting monoclonal antibodies are 
particularly useful in diagnosis, and, those which are 
neutralizing are useful in passive immunotherapy. 

C. General Methods 

The general techniques used in extracting RNA 
from the virus, preparing and probing a cDNA library, 
sequencing clones* constructing expression vectors, 
transforming cells, and the like are known in the art 
and laboratory manuals are available describing these 
techniques. However, as a general guide, the following 
sets forth some sources currently available for such 
procedures, and for materials useful in carrying them 
out. 

C.l. Hosts and Expression Con trol Sequences 

Both procaryotic and eucaryotic host cells may 
be used for expression of desired coding sequences when 
appropriate control sequences are used compatible with 
the designated host. Procaryotes are more useful for 
cloning: either procaryotes or eucaryotes may be used 
for expression. Among procaryotic hosts. E. coll Is 
most frequently used, mostly for convenience. 
Expression control sequences for procaryotes include 
promoters, optionally containing operator portions, and 
ribosome binding sites. Transfer vectors compatible 
with procaryotic hosts are commonly derived from, for 
example. pBR322. a plasmld containing operons conferring 
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ampicillin and tetracycline resistance, and the various 
pUC vectors , which also contain sequences conferring 
antibiotic resistance. The foregoing operons may be 
used as markers to obtain successful transf ormants by 
5 selection. Commonly used procaryotic control sequences 
include the fl lactamase (penicillinase) and lactose 
• promoter systems (Chang, et al. Nature (1977) 198:1056. 
the tryptophan (trp) promoter system (Goeddel. et al. 
Nucleic Acids Res (1980) 8:4057) and the X derived 
10 promoter and N gene ribosome binding site 

(Shimatake. et al # Nature (1981) 292:128). The 
foregoing systems are particularly compatible with IL. 
coli : if desired other procaryotic hosts such as strains 
of Bacillus or Pseudomonas may be used, with 
15 corresponding control sequences. 

Eucaryotic hosts include yeast and mammalian 
cell culture. Saccharomvces cerevlsciae . or Baker* s 
yeast and Saccharomvces carlsberoensls are the most 
commonly used yeast hosts, again because of 
20 convenience. Yeast compatible vectors carry markers 
which permit selection of successful transf ormants by 
conferring prototrophy to auxotrophic mutants or by 
conferring antibiotic resistance or resistance to heavy 
metals on wild-type strains. Yeast compatible vectors 
25 may employ the 2 micron origin of replication (Broach. 
j. ( et al, Meth Enz (1983) 101-307) the combination of 
CEN3 and ARS1. or other means for assuring replication, 
such as sequences which will result in incorporation of 
the appropriate fragment into the host cell genome. 
30 Control sequences for yeast vectors include promoters 

for the synthesis for glycolytic enzymes (Hess, et al. J 
Adv Enzvme Reg (1968) 7:149. Holland, et al. 
Biochemistry (1978) 17:4900). and the promoter for 3 
phosphoglycerate kinase (Hitzeman. et al. J Biol Cfrem 
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(1980) 255 :20731 . For yeast expression, terminators nay 
also be included, such as those derived from the enolase 
gene (Holland. M. J.. J Biol Chem (1981) 256:1385). 
Particularly useful control systems include those 
specifically described herein, which comprise the 
glyceraldehyde-3 phosphate dehydrogenase (GAPDH) 
promoter or alcohol dehydrogenase (ADH) regulatable 
promoter, terminators also derived from GAPDH. and, if 
secretion is desired, leader sequence from yeast alpha 
factor* These systems are described in detail in U.S. 
Serial Nos. 468,589 and 522,909. filed 22 August 1983 
and 12 August 1983, respectively, assigned to the same 
assignee, and incorporated herein by reference* 

Mammalian cell lines available as hosts for 
expression include many immortalized cell lines 
available from the American Type Culture Collection, 
including HeLa cells, Chinese hamster ovary (CHO) cells, 
baby hamster kidney (BHK) cells, and a number of other 
cell lines. Suitable promoters for mammalian cells 
prominently include viral promoters such as that from 
Simian virus 40 (SV40) (Piers, et al. Nature (1978) 
273 :113) or other viral promoters such as the Rous 
sarcoma virus (RSV) adenovirus, and bovine paplloma 
virus (BPV). Mammalian cells may also require 
terminator sequences. Vectors suitable foe replication 
in mammalian cells may include viral replicons. or 
sequences which insure integration of the appropriate 
sequences into the host genome. 

C.2. Transformations 

The transformation procedure used depends on 
the host to be transformed. Bacterial transformation 
generally employs treatment with calcium or rubidium 
chloride (Cohen. S. N. • Proc Natl Acad Sci (USA) (1972) 
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69.:2110, Maniatis. et al. Molecular Cloning: A 
Laboratory Manual (19B2) Cold Spring Harbor Press, p. 
254). Yeast transformations may be carried out using 
the method of Hinnen. A., et al, Proc Natl Read Sci 
5 (USA) (1978) 75:1929-1933. Mammalian transformations 
are conducted using the calcium phosphate precipitation 
method of Graham and van der Eb, Virology (1978) 52:546, 
oc the various modifications thereof. 

■ * ■ 

C.3. Vector Construction 
10 Vector construction employs techniques which 

are by now quite well understood. Site-specific DNA 
cleavage is performed by treating with suitable 
restriction enzyme under conditions which generally are 
specified by the manufacturer of these commercially 
15 available enzymes (see, e.g.. The New England Biolabs 
Product Catalog). In general, about 1 vg of plasmid 
oc DNA sequence is cleaved by 1 unit enzyme in about 20 
111 buffer solution for an incubation time of about 1-2 
hr at about 37°C. After incubation with the restriction 
20 enzyme, protein is removed by phenol/chloroform 

extraction and the DNA recovered by reprecipitation with 
ethanol. The cleaved fragments may be separated using 
polyacrylamide or agarose gel electrophoresis 
techniques, according to the general procedures found in 
25 Methods in Enzvmology (1980) 65:499-560. 

Sticky ended cleavage fragments may be blunt 
ended using E. coli DNA polymerase I (Klenow) in the 
presence of the appropriate deoxynucleotide 
triphosphates (dNTPs) using incubation conditions 
30 appropriate to the polymerase. The polymerase digests 
protruding 3 1 single strands, but fills in 5' protruding 
ends, according to the dNTPs present in the mixture. 
Treatment with SI nuclease may also be used, as this 
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results In hydrolysis of any single stranded DNA 
portion. 

Ligations are carried out using standard buffer 
and temperature conditions using T4 DNA ligase. and ATP: 
5 sticky end ligations require less ATP and less ligase 
than blunt end ligations. When vector fragments are 
used as part of a ligation mixture, the vector fragment 
is often treated with bacterial alkaline phosphatase 
(BAP) in order to remove the 5' phosphate and thus 
10 prevent religation of the vector: alternatively. 

restriction enzyme digestion of unwanted fragments can 
be used to prevent religation. 

Ligation mixtures are transformed into suitable 
cloning hosts, such as k. coli . and successful 
15 transformants selected by. for example, antibiotic 

resistance, and screened tor the correct construction. 

C.4. construction of Den ied DNA Sequences 

Synthetic oligonucleotides may be prepared 
using an automated oligonucleotide synthesizer as 
20 described by Warner. B. D. . et al. DNA. (1984) 

3:401-411. If desired, these synthetic strands may be 
kinased tor labeling with 32 P by using an excess of 
polynucleotide kinase in the presence of labeled ATP. 
under standard kinasing conditions.-_ 
25 DNA sequences including those isolated from 

genomic or cDNA libraries may be modified by site 
directed mutagenesis, as described by Zoller. M. et al. 
H .,HM g Acids Res (1982) lfi:"87-6499. Briefly, the DNA 
to be modified is packaged into phage as a single 
30 stranded sequence, and converted to a double stranded 
DNA with DNA polymerase using, as a primer, a synthetic 
oligonucleotide complementary to the portion of the DNA 
to be modified, and having the- desired modification 
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included in its own sequence. The resulting double 
stranded DNA is transformed into a phage supporting host 
bacterium, and cultures of the transformed bacteria, 
which will contain replications of each 6trand of the 
5 phage, are plated in agar to obtain plaques. 

Theoretically SO* of the new plaques will contain phage 
having as a single strand the mutated form; 50% will 
have the original sequence. Replicates of the plaques 
are hybridized to Kinased synthetic probe at 
10 temperatures and conditions which permit hybridization 
with the correct strand, but not with the unmodified 
sequence. The thus identified, desired, modified 
sequences are then recovered and cloned to serve as 
sources foe the desired DNA. 

15 C.5. Hybridization with Probe 

DNA libraries are probed using the procedure of 
Grunstein and Hogness ( Proc Natl Acad Sci (USA) (1975) 
7_3:3961). Briefly, in this procedure, the DNA to be 
probed is immobilized on nitrocellulose filters. 

20 denatured, and prehybridlzed with a buffer containing 

0-50\ formamide, 0.6 M NaCl. 60 mH sodium citrate, 0.02% 
(wt/v) each of bovine serum albumin, polyvinyl 
pyrollidine, and Flcoll, 50 mM sodium, phosphate (pH 

■ 

6.5), 1% glycine, and 100 ug/ml carrier denatured 
25 DNA. The percentage of formamide in the buffer, as well 
as the time, and temperature conditions of the 
pcehybrldization and subsequent hybridization steps 
depends on the stringency desired. Oligomeric probes 
which require lower stringency conditions are generally 
30 used with low percentages of formamide, lower 

temperatures, and longer hybridization times. Probes 
containing more than 30 or 40 nucleotides such as those 
derived from cDNA or genomic sequences generally employ 

♦ 
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higher temperatures, e.g. about 40-42* and a high 
percentage, e.g. 50\ formamide. Following 
prehybridizatlon. this same buffer, now containing the 
32 P kinased oligonucleotide probe, is added to obtain 
hybridization. Radioautography of the treated filters 
shows the location of the hybridized probe, and the 
corresponding locations on replica filters which have 
not been probed can then be used as the source of the 
desired DNA. 

C.6. Verification of Construction and Sequencing 

For routine vector constructions • ligation 
mixtures are transformed into E. coli strain HB101 or 
other suitable host, and successful transf ormants 
selected by antibiotic resistance or other markers. 
Plasmids from the transf ormants are then prepared 
according to the method of Clewell. D. B. • et al. Proc 
Hatl Acad Scl (USA) (1969) 62:1159. usually following 
chloramphenicol amplification (Clewell. D. B*. J 
Bacteriol (1972) 110:667). The isolated DNA is isolated 
and analyzed by restriction analysis, or sequenced by 
the dideoxy method of Sanger. F. , et al. Proc Natl Acad 
Sci (USA) (1977) 74:5463. aB further described by 
Messing, et al. Nucleic Acids Res (1981) 9:309. or by 
the method of Maxam. et al. M^thods-ln Enzymology (1980) 
65:499. 

D. Examples 

The following examples are intended to 
illustrate but not limit the invention. The procedures 
set forth, for example, in Is D.l and D.2 may. if 
desired, be repeated but need not be. as techniques are 
available for construction of the desired nucleotide 
sequences based on the information provided by the 
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invention. Expression is exemplified in E. coll and in 
yeast, however other systems are available as set forth 
mote fully in ic.l. Additional epitopes derived from 
the genomic structure may also be produced* and used to 
generate antibodies as set forth below. 

D.l. Preparation of cDNA 

D.l.a. Production of BVD Virus 
Bovine Embryonic Kidney cells (BEKI) cells were 
grown in MEM (Earl's) containing 0.B5 g/1 NaHC0 3 and 
10\ of irradiated fetal calf serum. The biologically 
cloned Osloss strain of BVD virus was passaged S times 
through BEKI cells at a multiplicity of 0.1. Cytopathic 
effects, consisting of clustering of cells followed by 
vacuolation and then cell lysis, were readily observable 
from the first passage. Final titers (~ 10 pfu/ml) 
were obtained after recovery of virus by freezing and 
thawing of infected cells. 

Foe the virus production, 175 cm plastic 
flasks of subconfluent BEKI cells were used. The cells 
were washed 3 times with infection buffer (MEH (Earl's) 
+ 2.2 g/1 NaHC0 3 . pH 7.6) and then were infected with 
2 ml of BVD in infection buffer at a multiplicity of 
0.05 pfu/cell. After 1 hr at 35°C. IB ml of infection 
buffer was added and the cells were incubated for 4-5 
days at 35 W C, after which cytopathic effect (vacuolation 
followed by cells lysis) was greater than B0\. In a 
typical production, 150 flasks of cells were infected. 
The medium (about 3 liters) was collected and stored at 
4°C. The remaining cells were scraped in 2 ml of 
infection buf f er/f lasK. subjected to 3 cycles of 
freezing and thawing, and the final suspension was added 
to the infection medium. After a centrifugation at 
10,000 g for 30 min, the supernatant was concentrated 
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10-fold by ultracentrifugation at 120,000 g for 4 hrs 

and 40 min at 4°C. 

Infectious virus had a density of 1.12 g/ml as 
measured by isopicnic banding in sucrose density 
gradient, and appeared as 45-55 nm spherical particles 
by electron microscopy. The virus preparations were 
neutralized by anti-BVD antiserum from rabbits injected 
with virus or from bo vines. 

D.l.b. Extraction and Puri fication nf viral RNA 
RNA was isolated from the virus pellet by the 
CsCl/guanidinium thiocyanate method as described by 
Chirgwin. et al. Biochemistry (1979) 1S/-3M4. and the 
purified RNA stored in 70\ ethanol at -20*C. This RNA 
preparation contained a large amount of contaminating 
low molecular weight cellular RNA and intact viral RNA, 
Viral RNA was further purified by sucrose density 
gradient centr ifugation as follows: 

An aliquot containing an estimated amount -of 5 
V g of BVD-RNA was centr if uged at 10.000 g for 15 min 
at 4*C. The pellet was washed with SO* ethanol. 
denatured in 375 *1 of 99\ DMSO (99\). 5 mM Tris-HCl 
(pH 7.5) and incubated for 5 min at 37-C. After 
addition of 1.125 ml of 5 mM Trie HCl_(pH 7. _5). ImM 
EDTA. 1\ Sarkosyl. the solution was heated for 2 min at 
70-C and quenched on ice. This solution was distributed 
on 5x15-30% sucrose gradients in 5 mM Tris HCl (pH 7.5). 
10 tfM EDTA. 0.1M NaCl. 1% Sarkosyl (in sterile 
siliconized Beckman SW40 tubes). A sixth gradient was 
loaded with 3* end labeled RNA as a marker (see below). 
After a centr if ugation for 16 hrs at 19.000 rpm (20°C). 
the gradients were fractionated (1 ml fractions). The 
RNA from each fraction of the gradient corresponding to 
that containing marker-labeled RNA was precipitated with 
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2.5 volumes of ethanol in the presence of carrier yeast 
RNA (10 ug) and subjected to formaldehyde agarose gel 
electrophoresis, Lehrach, et al. Biochemistry (1977) 
16.: 4743, to determine which fraction contained the 
BDV-RNA band. Fractions corresponding to those 
containing the BDV-RNA, were pooled from the parallel 
gradients and precipitated with 2.5 volumes of ethanol, 
washed wijth 60% ethanol and stored at -20°C in 70* 
ethanol. 

The purified viral RNA was labeled with 

32 

P-pCp (3000 Ci/n mol) according to England, et al, 

Meth Bnzvmol (1980) 65:65-74, and analyzed by agarose 

gel electrophoresis in the presence of 2.2 H 

formaldehyde as described in Lehrach. et al, (supra). 

3 

Fluorography was done with H-Enhancer (NEN) as 
recommended by the manufacturer. 

The majority of the radioactivity was 
associated with low molecular weight RNA (less than 2 
kb), but a small proportion was found in a high 
molecular band approximately 12.5 Kb, identified as RNA 
by labeling properties with RNA ligase, its sensitivity 
to RNAse and alkali, and resistance to DNAse and 
proteinase K. In agreement with other reports on 
toga viruses of the flavivirus group, the BDV-RNA did not 
bind to oligo dT cellulose, showing either the absence 
of a poly A stretch at the 3 1 end. or that, if present, 
the poly A is extremely short. Control Slndbis virus RNA 
was properly retained by the same column. 

These properties of the 12.5 kb band were 
identical with those shown by RNA extracted from BEKI 
cells, grown as follows: 

2 

BEKI cells were grown in 25 cm plastic 
flasks, washed 3 times with infection buffer, and 
infected at multiplicities of 50-100 pfu/cell with 1 ml 
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of BDV solution. After one hour at 35°C, 4 ml of 
infection buffec was added and the incubation was 
continued. After 12, IS, 18, 21 and 36 hra (36 he 
corresponds to a complete cycle of BDV replication), the 
5 newly synthesized RNA was labeled with H- uridine (100 
uCi/dish). Uninfected cellular RNA harvested after 18 
hrs of incubation was also analyzed. After 30 min of 
labeling, the cellular RNA was extracted using the 
CsCl/guanidinium thlocyanate method of Chirgwin et al, 

10 1979 (supra). The pellet of RNA, obtained after 

ultracentrifugatlon through a 5.7 H CsCl cushion, was 
directly analyzed by formaldehyde agarose gel 
electrophoresis and gel was dried and f luorographed. In 
all the incubation times tested, a 12.5 kb band which is 

15 absent in the uninfected cells could be detected which 
has the same physico-chemical properties as shown by the 
RNA above. 

D.l.c. Preparation of cDNA 

The viral RNA isolated from the virus in 

20 ID.l.b. was polyadenylated using the method of Sippel, 
Eur J Biochem (1973), 32:31-40. Briefly, the estimated 
amount of 0.7 ug of purified BVD RNA was incubated in 
5 ml of 5 AH methylmercury hydroxide for 10 min at room 
temperature and Incubated for 6 min at 37*C with 20 

25 units of poly A polymerase (BRL) and 500 yCi of 

3 H-ATP (36 Ci/mmol, Amersham) in 50 vl of 50 mM HC1 
(pH 7.5). 10 mM MgCl 2# 2.5. mM MnCl 2 . 0.3 M NaCl. 1.5 
DM 2-mercaptoethanol and containing 2.5 vg of 
RNAse-f ree BSA and 5 units of human placental 

30 ribonuclease inhibitor (BRL). After phenol/chloroform 
extraction, the RNA was purified by chromatography on 
Sephadex G50 and precipitated with 2.5 volumes of 

9 
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ethanol. The polyA RNA was used to prepare probes and 
as a template for the cDNA library. 

To make probes 1 u9 of the polyA RNA was 
incubated foe 10 mln at room temperature in 5 vl of 10 
5 DM methylmercury hydroxide and then 45 min at 37°C with 
40 units of reverse transcriptase in 100 ml of 50 mM 

Tris HC1 (pH 8.3), 10 mM MgCl 2# 1.5 mM 

2-mercaptoethanol # 1 mM dATP. dGTP and dTTP. 10 yM 

dCTP, 0.2 mg/ml of actinomycin D, 5 units of human 

10 placental ribonuclease inhibitor. 500 vCi of alpha 

32 

P-dCTP (3000 Ci/mmole, Amersham) and 20 U9 of 
oligonucleotides obtained by partial digestion with 
DNAse I of calf thymus DNA (random primers). After 15 
and 30 min. ten more units of reverse transcriptase were 

15 added. After phenol/chloroform extration and Sephadex 
G50 column chromatography the RNA was hydrolyzed with 
0.1 M NaOH (1 hr at 65°C) thus yielding single stranded 
cDNA strands. The solution was neutralized with 0.1 M 
acetic acid and added directly to the hybridization 

20 buffer. 

For the cDNA library two separate cloning 
protocols involving dT (12-18) primers or random (calf 
thymus), DNA-derived oligonucleotide primers were used. 
RNA polyadenylated in vitro as described above was 

25 used. Approximately 1 ug polyadenylated RNA was 

incubated with 10 mM methylmercury hydroxide in a 10 
ill volume for 10 min at room temperature, and excess 
reagent was titrated by adding 1 ]il of a 3M 
2-mercaptoethanol solution. This denatured polyA RNA 

30 was used immediately in the presence of 50 nM Tris pH 
8.0. 1 mM dATP. dGTP. dCTP and dTTP. 2.5 ug/ml dT12-18 
or the calf thymus* random oligonucleotide primers. 10 mM 
MgCl 2 « 10 iig/ml actinomycin D. 100 units of RNAse 
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inhibitor. (BRL) and 60 unite of reverse transcriptase in 
a total volume of 100 ui- 

The samples were diluted to 400 ul with a 
buffer containing 10 mM Trie pH 7.0. 100 mM NaCl. 10 mM 
EDTA and 0.2\ SDS extracted with phenol/chloroform, 
freed of dNTPs by Sephadex G50 chromatography, and 
ethanol precipitated. 

The precipitated mixture of RNA and cDNA 
hybrids (10 ul) were diluted into 50 ml of SI buffer 
(500 mM NaCl, 50 mM Na acetate pH 4.5 and 1 mM ZnCl 2 
and digested for 15 min at room temperature with 20 
units of SI nuclease. The reaction was stopped by 
diluting the sample to 500 ml with a buffer containing 
50 mM NaCl, 10 mM EDTA and 50 mM Tris pH 7.0. and 
digestion was continued for 15 min at room temperature 
by adding 20 ug/ml of RNAse A. After phenol and 
chloroform extraction, the RNArcDNA hybrids were 
concentrated by ethanol precipitation and fractionated 
on a Sepharose CL4B column prepared in a 1 ml plastic 
pipette. The excluded peak, containing molecules larger 
than BOO base-pairs, was pooled and ethanol precipitated 
to give 50 ng of hybrid for the dT primed, and 200 ng of 
hybrid for the random calf thymus fragment primed 
reactions. 

Both samples were tailed for dC residues under 
conditions yielding 15-25 residues per DNA or RNA 
termini, and annealed to a dG tailed pBR322 vector 
linearized at the Pstl site (NEN) at a vector 
concentration of 0.1 vq/ml. The annealed plasmids 
were transformed into E, coli HB101 to Amp to obtain 
the cDNA library. 
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D.2. Screening of the cDNA Library 

Screening employed a +/- method using labeled 
cDNAs prepared from RNA isolated from uninfected BEKI 
cells (-probe) and from RNA isolated from the virus 
obtained after complete lysis of the cells (+ probe). 
Colonies of the E. coli harbored cDNA library were 
grown, lysed on nitrocellulose filters (two replicas) 
and probed. The hybridization buffer used for + probe 
also contained an excess of cellular RNA isolated from 
uninfected BEKI cells (10 mg/ml). The colonies which 
gave a clear signal with the + probe and no response 
with the - probe were selected. By this method, 95 
oligo dT- primed and 18S random primer primed clones were 
selected. The length of the inserts after Pstl 
digestion varied from 400 to 4.000 base pairs. No 
full-length virus specific cDNA was obtained. 

One of the clones, pDT2B, with a 880 bp insert 
was selected for further analysis. This fragment from a 
Pstl -digest of plasmid DNA was purified by acrylamide 
gel electrophoresis, digested with Ddel and Mb o I and 
then labeled with the Klenow fragment of DNA polymerase 
I and the four 32 P dNTPs to yield 10 6 -10 4 cpm/mg 
of insert. Labelled Insert was verified by 
hybridization to viral RNA fractionated on a 0.9\ 
agarose gel electrophoresis in presence of formaldehyde 
(Smiley, et al. Anal Blochem (1983) 131:365-372). 
Stringent hybridization conditions were used: 
prehybridizations and hybridizations were overnight at 
42*C, and 50% formamide was used in hybridizations. 
Washing was at 65°C first with 2xSSC, 0.1\ SDS and then 
with 0.2XSSC and 0.1\ SDS • 

In the foregoing verification. RNA from 
uninfected cells was used as negative control. The 
absence of exogenous viral sequences in the genome of 
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the cells was verified by failure of cellular DNA 
digested with BamHI and EcoRI to bind to pDT28 pcobe in 
Southern blot analysis. The RNA from infected cells 
after 24 hrs of infection at a multiplicity greater than 

3 l, and from the pellet of virus after complete cell 
lysis were used as positives. No hybridization was 
detected with the RNA from the uninfected cells, but the 
inserts hybridized to an approximately 13 kb band of the 
RNA isolated from the infected cells or from the pellet 

10 of virus. 

The plasmid pDT2B. which had been verified to 
contain a Pstl insert which binds to the viral RNA. was 
used to probe the cDNA library tor additional clones, 
and the entire sequence was recovered by -walking" 
15 techniques. In this way. eight additional plasmids were 
recovered which span the entire 12.5 kb genome of the 
virus. The positions of the overlapping insects are 
shown in Figure 1. As shown in Figure 1. the pDT28 
clone occupies a roughly central portion of the genome. 
20 The 8 additional plasmids recovered from the cDNA 

library in a manner analogous to that described above, 
but using the appropriate overlapping 
sequence-containing clone as probe, were grown in Ej_ 
coll . and the plasmid DNA isolated. The inserts were 
25 sequenced, and verified to contain overlapping 

portions. The results of this sequencing are shown in 
Figure 2. which provides the entire genomic RNA sequence 
ascertained from the inserts. 

The orientation shown in Figure 2 was 
determined by subcloning pDT2B into M13 into both 
orientations, labeling the resultant phage, and using 
the labeled phage as a probe against RNA known to be of 
positive polarity. This was done by spot hybridization 
on nitrocellulose filters using uninfected cell RNA. 
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infected cell RNA # and template viral RNA. The infected 
cell RNA and template RNA should be of positive 
polarity. Therefore, the M13 orientation hybridizing to 
infected cell RNA and viral RNA contains a negative 
5 sense strand, and from this information, the 5 • to 3' 
sequence of inserts from pCT63 to pCT185 could be 
deduced. 

This conclusion was confirmed by analysis of 
the sequence of pCT63. which indicates its capability to 
10 form the expected hairpin structure at the 5 1 end. and 
by the absence of additional clones in the cDNA library 
having additional 5' sequences to that of pCT63. 

* * 

D * 3 * Expression of S equences Encoding BGal-BPV Fusions 
in E. coli 

15 Twelve portions of the BDV genome were obtained 

as follows: (1) the entire cDNA sequences per se. (2) 
products of restriction cleavage (with Pstl or BamHl or 
both) of the foregoing cDNAs. and (3) a ligated sequence 
obtained by ligating the pCTlB5 cDNA with a fragment of 

20 another. (See the table below.) These portions were 
used to encode the BDV portions of the fusion proteins. 
These eleven BDV protein encoding sequences were cloned 
into one of or a mixture of pUR29jO, pUR291, and pUR292» 
which contain restriction sites, e.g.. BamHl and Pstl 

25 sites in all three possible reading frames with the 
B-gal codons, so as to encode fusion proteins at the 
C-terminal portion of the B-galactosidase protein 
(Rut her. U.; et al. Embo J (1980) 1791-1794) . Since 
all three possible reading frames are provided for the 

30 restriction sites used, the correct reading frame in at 
least one of the vectors for the fusion protein is 
assured. Table l summarizes the vectors prepared and 
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the BDV sequence contained in each. Nucleotide numbers 
are as indicated in Figure 2. 

Table 1 



10 



15 



20 



25 



30 



Name 

pUBVDl 

pUBVD2 

pUBVD4 

pUBVDS 

pUBVD6 

PUBVD7 

pUBVDB 

PUBVD9 

pUBVDIO 

pUBVDll 

PUBVD12 

PUBVD13 



pUR 

Parent 

pUR290 

pool 

pUR292 

pool 

PUR290 

pool 

PUR292 

PUR292 

PUR290 

PUR290 

pUR291 

PUR290 



BDV Insert 
Derived from 

pCT 6 3 
pCT 3 6 

per lb 3 

pDT28 
pCT174 
pCT174 
pDT65 

pDT65 + pCTlBS 
PCT185 
pCT185 
pCT 6 3 

PDT28 + pDT17 



BDV Nucleotides Con- 
tained in pUBVD Vectors 
(Numbers as in Fig. 2) 



2607 

2574 

4560 

-6450 

-10718 

-10811 

-10811 

-12470 

■12457 

-12457 

-1397 

—7800 



1397- 
2037- 
2955- 
5650- 
7225- 
-9500- 
10442- 
10442 
11030 
11405 
597 
-6000 
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Each of the twelve cDNA sequences was mixed 
with T4 ligase in the presence of PstX-digested mixtures 
of pUR290. 291. and 292 (or of one of these if the 
correct reading frame was deduced) and the ligation 
mixture transformed into E. coll strain D1210 (Lacl* 
mutant of HB101) to Amp*. Successful transf ormants 
were confirmed by hybridization with labeled insert, and 
isolated plasmid DNA was analyzed by restriction 
analysis to confirm correct orientation. Expression was 
induced in successful transf ormants containing correctly 
oriented insects by treating with IPTG (1 mH) on L-broth 

* 

medium containing 40 ug/ml ampicillin. Three hours 
after induction, the cells were harvested, and lysed by 
sonication. The fusion proteins were produced as 
inclusion bodies, and the inclusion bodies were 
harvested by the method of Klempnauer. et al. Cell 
(1983) 33:345-355. and stored at -20°C suspended in 10 
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mM Trie (pH 8.0). 1 tnM EDTA. Approximately 10-30 mg 
inclusion body proteins were obtained per ml of 
culture. 

Characterization of the Fusion Proteins 

The fusion proteins were characterized as to 
their antigenic properties both in insoluble and 
solubilized forms. 

Inclusion body proteins solubilized in IX SDS 
or 7 M urea followed by dialysis to a final 
concentration of 1 mg/ml are unreactive with sera from 
infected calves or from rabbits infected with purified 
virus. 

Preparation of Antisera . Both solubilized and 
unsolubilized inclusion bodies were injected into 
rabbits using perl-lymph nodal immunizations with 500 
ug protein emulsified with Freund's complete adjuvant, 
with boosting every 4 weeks (IH injection of 500 jig 
emulsified in adjuvant) and bled 10 days after boost. 
Control antisera were prepared from infected calves or 
from rabbits injected with purified virus. The antisera 
were tested for immunoactivity by EL ISA and immunofluo- 
rescence, and by Western blot and immunoprecipitation. 

Western blot and immunoprecipitation yield 
complementary Information with respect to reactivity. 
In immunoprecipitation* the native protein mixture is 
reacted with the test serum and the immunoprecipitate 
subjected to SDS-PAGE. Therefore, immunoprecipitation 
assesses immunoreactivity with the native protein. 

However, in the Western SDS blot procedure, 
PAGE is performed before the antisera are tested for 
precipitation with the proteins on the gel. Therefore, 
Viestern blot assesses reactivity with denatured 
protein. ' 



0208672 

-34- 

The results of these procedures are given 

below. 

Results . The control antisera were 
immunoceactive with respect to proteins extracted from 
S the virus pellet produced on BEKI cells, and shoved 
immunoprecipitation with the 76 kD protein presumed to 
be the major antigenic component, as well as minor 
components presumed to be, at least in part* virion 
proteins having molecular weights of 36. 43, 47. 51 and 
10 56 kD. No immunoprecipitation occurred when the' control 
antisera were tested on Western blot. Control antisera 
against Infection thus react with antigens in the native 
protein, but not after denaturation. 

Immunoprecipitation and Western Blot . Most of 
15 the antisera formed in response to the fusion proteins 
were negative both in assay by immunoprecipitation and. 
like the control antisera. on Western blot* 

However, there were exceptions. The antiserum 
generated by fusion protein 7 immunoprecipitates the 36 
20 itD protein from BEKl-grown virus and reacts by Western 
blot to the 76 kD and 51 kD bands. Antiserum from 
fusion 5 immunoprecipitates 3 sizes of proteins: 64. 98, 
and 105 kD. sizes not precipitated by control antisera. 
Antiserum from fusion 9 precipitates a SB kD band, also 
25 not precipitated by the control antisera. The 

signif ican«rof KW of the materials is not clear since 
it is not clear which, if any. of these proteins 
represent glycosylated materials with corresponding 
alterations in molecular weight. 
30 ELISA (carried out according to the procedure 

of Bartlett. et al. in Peptides of the Biological 
Fluids . H. Peeters, ed.. Pergamon Press. Oxford. 1976, 
24:767-770) used partially purified virus as antigen. 
Only the antiserum prepared against fuqion protein 7 was 

» 
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positive at a 1:40 titer; serum prepared against fusion 
proteins 5 and 11 had titers of 1:4 and 1:8. 
respectively. Nonimmune sera were negative. 



5 live or fixed infected cells. The antiserum prepared 
against fusion protein 11 was slightly positive in 
immunore^ctivity with live cells: on cells fixed with 
methanol, acetone, or formaldehyde, serum prepared from 
fusion protein 7 gave the same strong response as 
10 control antisera from the infected animals, whereas 

antisera 5 and 3 were weakly positive against proteins 

extracted from the virus pellet produced on BEKI cells. 



Immunof luorescence was conducted using labeled 
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Clalms 

1. A nucleotide sequence substantially 
identical with that of the BDV genome as represented in 
Figure 2. 

2. A nucleotide sequence encoding at least one 
vital polypeptide substantially identical with that 
encoded by the BDV genomic sequence shown in Figure 2. 

3. A nucleotide sequence derived from a 
portion of the BDV genomic sequence shown in Figure 2. 

4. A recombinant expression system capable, in 
a compatible host cell, of effecting the production of a 
BDV related protein, which system comprises a DNA 
sequence derived from the nucleotide sequence of claim 1. 

5. A recombinant expression system comprising 
a coding portion derived from the sequence of claim 1 
operably linked to a control sequence compatible with a 
desired host. 

6. A recombinant vector which comprises the 
expression system of claim 4* 

7. A recombinant .vector which comprises the 
expression system of claim 5. 

B. Recombinant host cells transformed with the 
vector of claim 6. 



9. Protein produced by the cells of claim 8 
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10. The system of claim 5 which further 
includes upstream of said DNA sequence, and in reading 
frame therewith, a fused nucleotide sequence encoding a 

host protein or portion thereof. 

5 11. The system of claim 10 wherein the fusion 

DNA sequence encodes an N-terminal portion of 
B-galactosidase. 

12. Recombinant host cells transformed with a 
vector comprising the system of claim 10. 

10 13. Protein produced by the cells of claim 12. 

14. A particle immunogenic against BDV 
infection which particle comprises a polypeptide having 
an amino acid sequence capable of forming a particle 
when said sequence is produced in a eucaryotic host, and 

15 a neutralizing epitope of BDV. 

15. The particle of claim 14 wherein the 
particle forming amino acid sequence is derived from 
hepatitis B virus. 

* 

16. The particle of claim 15 wherein the 
20 particle forming amino acid sequence is derived from 

HBsAg. 

* 

17. A vaccine effective against bovine 
diarrhea virus which comprises the polypeptide of claim 
9. 

25 IB. A method for preparing an anti-BDV vaccine 

which comprises culturing the cells of claim 8 and 
recovering the recombinant peptide. / 
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Mapping of 9 BDV cDNA clones which span the whole genome. Clones were 
derived from oligo dT primed cDNA ( DT clones) or from randomly primed 
cDKA using calf thymus oligonucleotides (CI clones). Names of clones 
are as follow: l*pCT63; 2-pCT36; 3-pCTH*; 4«pCT70; 5-pDT28; 6-pDT17: 
7"pCTl74; *8«pDT55; 9"pCXj^.85j lo*pCf$0» * 

cDNA fragments used to "construct expression vectors for E.coli by 
fusion to the E.coli B-galactosidase gene. " 
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1 ATGTATACGAGAATTT6CCTAACCTCGTATACATATTGGGCATTCTAAAAATAAATTA6GCC 
CATATGCTCTTAAACGGATTGGAGCATATGTATAACCCGTAAGATTTTTATTTAATCCGG 

56 stul. 61 astll, 

63 TAAGGGACAAATCCTCCTTAGCGAAGGCCGAAAAGA6GCTAGCCAT6CCCTTAGTAGGAC 
ATTCCCTGTTTAGGAGGAATCGCTTCCGGCTTTTCTCCGATC6GTACGGGAATCATCCTG 

100 nhsl t 

123 TAGCAAAACAAGGAG6GTAGCAACAGTG6TGAGTTCGTTGGATGGCT6A 

ATCGTTTTGTTCCTCCCATCGTTGTCACCACTCAAGCAACCTACC6ACTTCGGGACTCAT 

183 CAGGGTAGTCGTCAGTGGTTCGACGCTTCGTGTGACAA6CCTCGAGGTGCCAC6TGGA 

GTCCCATCA6CAGTCACCAAGCT6CGAAGCACACTGTTCG6AGCTCCACGGTGCACCT6C 

223 Hhol t 

243 AGGGCATGCCCACAGCACATCTTAACCT6AGCGG6GGTC6TTCAG6T6AAA6C6GTTTAA 
TCCCGTACG6GTGTCGT6TAGAATTGGACTCGCCCCCAGCAAGTCCACTTTCGCCAAATT 

246 sphl, 

303 CCAACCGCTACGAATACA6CCTGATAGGGTGCTGCAGAG6CCCACTGTATTGCTACTAAA 
GGTT6GCGATGCTTATGTCGGACTATCCCAC6ACGTCTCC6GGT6ACATAAC6AT6ATTT 

334 pstl. 

MetGluLeuIleThrAsnGLuLeuLeuTyrLysThrTyr 
AATCTCTGCTGTACATGGCACAT6GA6TTGATTACAAATGAACTTTTATACAAAACATAC 
TTAGAGACGACATGTACCGTGTACCTCAACTAATGTTTACTTGAAAATATGTTTTGTATG 

LvsGlnLysPraAlaGlyValGluGluProValTyrAanGlnAiaGlyAspProLeuPhe 

423 AAACAAAAACCCGCTGGAGTGGAGGAACCAGTATATAACCAA^ 

TTTGTTTTTGGGCGACCTCACCTCCTTGGTCATATATT6GTTC6TCCACT6G6AAACAAA 

468 bst£2 t 

GlyGluArgGlyValValHisProGlnAlaThrLeuLvsLeuProHijl-VfAr^ 
483 GGtGAGAGAGGAGT6GTTCATCCGCAGGCGACGCTAAAACTGCCA 

•CCGCTCTCTCCTCACCAAGTAGGCGTCCGCTGCGATTTTGAC6GTGTATTTTCTCCCCTC 

ArqGiuValProThrAsnLeuAlaSorLttuProLvBAraGlyA^ 
543 CGCGAAGTACCTACTAATCTGGCGTCTCT6CCAAAAAGaG6T6ACT6cA6^ 

GCGCTTCATGGATGATTA6ACCGCAGAGACGGTTTTTCTCCACT6ACGTCCAGCCCATTG 



363 



603 



663 



587 pstl. 

* 

SerLysGlYProValSerGlYlleTYrLeuLvBProGlyProLjuPheTvrGl^ 
AGCAAGGGACCCGTGAGTGGAATCTACCTGAAACCGGGbCCG 

TCGTTCCCT6GGCACTCACCTTA6AT66ACTTT66CCCC6GCAATAAGATGGTCCTAAT6 

LysGlyProValTyrHisArgAlaPpoLeuGluPhePheGlnGluAlaSerMatCysGlu 
AAAGGACCCGTCTATCATAGAGCTCCATTGGAGTJCTTTCAGGAAGCCTCTATGTGT^^ 
TTTCCTGG6CA6ATA6TATCTCGAGGTAACCTCAAGAAAGTCCTTC66A6ATACACACTC 

* 

682 sacl v 

A ThrThrArgArqlleGlYArqValThrGlySerAM^ 
723 ACAACTAGAAg2aTTGGGAGAGTAACTGGTAGT6ATGGTAAATTGTACCACATTTA^ 

TGTTGATCTTCCTAACCCTCTCATT6ACCATCACTACCATTTAACAT66T6TAAATACAC 

CysIleAspGlyCyslielleValLYsS^AlaThrLy^ 

783 tgcatagatggAtgcataatagttaAgagcgccacaaAaw 

ACGTATCTACCTACGTATTATCAATTCTCGCGGTGTTTTATAGTAGTTTTCCATGAGTTT 



794 avaS, 



843 



TrpValHisAsnLysLeuAsnCvsProLeuTrpVal^^ 
TGGGT CCACAACAAACT AAATTbCCCTCTATGGGTTTCA^ 

ACCCAGGTGTTGTTTGATTTAACGGGAGATACCCAAAGTTC6AC6AGGCT6TGTTTTCGT 
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GluGlyAlaTbrArgLysLysGlnGl nLysProAs pArqLeuG luLytGl yArqMe tLyft 
03 G AAGGbGCG A CAAGAAAGAAGCA ACAAAAACCAGA i AGuCTGGAAAAGGGGAGbATGAAG 
. CTTCCCCGCTGTTCTTTCTTCGTTGTTTTTGGT CTAT CCGACCTTTTCCCCTCCTACTTC 



I leThrProLysGluSerGluLysAspSerLysThrLysProProAspAlaThrlleVal 
63 ATAACTCCT A A AGAGT CGGAGAA AGAT AGTAAGAC C AA ACCGCCAGA I GCTACGAT AGTG 
TATTGAGGAT TTCT CAGCCTCTTTCTATCATTCTGGTTTGGCGGTCTACGATGCTATCAC 

Va 1 AspGlyValLysTyrGlnVaiLysLysLysGt yLyslleLysSerLysAsnThrGIn 
23 GT AG ATGGTGT CAAAT AT CAGGTAAAGAAAAAAGGGAAA ATCAAGAGT AAGAATACCCAG 
CATCTACCACAGTTTATAGTCCATTTCTTTTTTCCCTTTTAGTTCTCATTCTTATGGGTC 

AspGlyLeuTyrHisAsnLysAsnLysProGlnGluSerArgLysLycLeuGluLysAla 
83 GACGGTTTGT ACCACAACAAAAATAAACCTCAAGAGTCACGCAAGAAACTAGAGAAAGCC 
CTGCCAAACATGGTGTTGTTTTTATTTGGAGTTCTCA6TGC6TTCTTTGATCTCTTTCGG 

1140 bgll v 

LeuLeuAlaTrpAlaVal I 1 eAl aLeuValLeuPheGlnVal AlaValGl yGluAsnl 1» 
43 CTGTTGGCATGGGCAGTAATAGCCTTGGTTTTGTTTCAAGTCGCAGTG6GAGA6AACATA 
GACAACCGTACCCGTCATTATCGGAACCAAAACAAAGTTCAGCGTCACCCTCTCTTGTAT 

ThrG InTrpAsnLeuG 1 nAspAsnGlyThrGluGly 1 leGlnArgAlaHatPh«Gl nAra 
03 ACACAATGGAACTTACAAGACAATGGGACGGAAGGAATACAACG6GCCATGTTCCAAAGA 
TGTGTTACCTTGAATGTTCTGTTACCCTGCCTTCCTTATGTTGCCC6GTACAAGGTTTCT 

Gl vVal AsnArgSerLeuHisGlyl leTrpProGluLysl ItCysThrGlyValProSvr 
63 GGCGTAAATAGAAGTCTGCATGGGATCTGGCCAGA6AAAATCTGTACAG6TGTCCCCTCC 
CCGCATTTATCTTCAGACGTACCCTAGACC6GTCTCTTTTA6ACAT6TCCACAGG66AG6 

1270 ball, 

HisLeuAlaThrAspThrGluLeuLysAlal l«HiB61yH*tH«tAt pAlaSsrGluLys 
123 CACTTGGCCACTGATACAGAACTGAAGGCAATTCATGGTATGATGGATGCTAGCGAGAAG 
6T6AACCGGTGACTATGTCTTGACTTCCGTTAAGTACCATACTACCTAC6ATCGCTCTTC 

1327 ball, 1333 tthllll, 1371 nhei , 

ThrAsnTyrThrCysCysArgLeuGlnArqHisGluTrpAsnLysHiftGlyTrpCysAsn 

563 acaaattacacatgctgcaggctccaacgccatgagtggaacaagcatggTtggtgcaat 
tgtttaatgtgtacgacgtccgaggttgcggtactcaccttgttcgtaccaaccacgtta 

1397 pstl, 

TrpTyrAsnl leGluProTrpIlBValLeuf1etAsnLysThr61nAlaAsnL»uAlaGlu 
,43 TGGTACAATATTGAACCTTGGATTGTTCTCATGAATAAAACCCAAGCCAACCTT6CTGA6 
ACCATGTTATAACTTGGAACCTAACAAGAGTACTTATTTTGG6TTCGGTT66AACGACTC 

1449 sspl, 

GlyGlnProProArgGluCysAlaValThrCysArgTyrAspArgAftpSerAspLeuAsn 
S03 GGTCAGCCACCAAGbGAGTuT£CCGTTACATbCCGGTATGAC€GAGATA6TGACCTAAAT 
CCAGTCGGT6GTTCCCTCACAC6GCAAT6TAC6GCCATACT6GCTCTATCACTG6ATTTA 

Va 1 Va 1 ThrG 1 nA 1 a Arg AsnSerPraThr Pr oLeuThrGl y CysLysLysGlyLy iAtn 
SA3 6TAGTAACACAAGCTAG6AACAGCCCCACACCATTGACAGGCTGCAAGAAAG6CAA6AAC 
CATCATTGTGTTCGATCCTTGTCGGGGT6TGGTAACT6TCCGAC6TTCTTTCC6TTCTTG 

PheSerPheAlaGi yValLeuValGlnGlyProCysAsnPh«Glull«AlaValSorA»D 

623 ttctcctttgcaggtgtgtjg6tacaaggbccttbcaactttgaaata6ctgtaa6t6at 
aagaggaaacgtccacacaXccatgttcccggaacgttgaaactttatcgacattcacta 

ValLeuPheArgG luHisAspCysThrSerVa 1 UeGinGlyThrAlaHiftTyrLeuVal 
663 GTGCTGTTTAGAGAGCACGATT6CACAAGTGTGATTCAAGGCAC6GCTCACTATCT6GTA 
CACGACAAATCTCTCGTGCTAACGTGTTCACACTAAGTTCCGTGCCGA6TGATAGACCAT 

AspGlyMetThrAsnSerLeuGluSerAlaArgGlnGlyThrAlaLyfiLsuThrThrTrp 
743. GACGGbATGACCAATTCTCTAGAAAGTGCCAGGCAAGGGACCGCAAAGTTAACTACTTGG 
' CTGCCCTACTGGTTAA6AGATCTTTCAC6GTCCGTTCCCTG6CGTTTCAATT6AT6AACC 

1760 xbal, 1790 hpal f 

tSy5 l X Ar 9 GlnLeuL Y sL y sLeu61 y L y«Ly*LeuGluA»nLymS«rLy«ThrTrpPh» 
603 TTGGGTAGGCAGCTTAAGAAACTAGGGAAGAAACT6GAAAACAAGA6TAA6ACATGGTTT 
AACCCATCCGTCGAATTCTTTGATCCCTTCTTT6ACCTTTTGITCTCATTCT6TACCAAA 

1615 a£lll, ' t 

5i^ii?I yrAlaAl i5S rProT Y rC y* ei «ValGiuArgArgL«u61yTyrIi»TrpTyr 

663 ggggcatatgcagcctctccctactgcgaggtagaacgSagScttggttacatctggtat 
ccccgtat acgt cgg agagggatgacgctccat cttgcct cc6aaccaatgt agaccata 



1923 



2103 



2223 



2263 
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ACAAAGAATTGCACCCCTGCCTGTT TACCAAAAAAT ACAAAGAT CGTTGG^rrrrr¥ at? 
TGTTTCTTAAC6TGGGGACGGACAAATGGTTTTTTATGTTTCTAGCAACCGGM 

AAGCTGTGGTTACGCCTCCTACCATTTTATGACGTACTCTACCCCCC6GT^ 
2043 GTGCTACTACTCTCA6TG6TAGTGCTTTCCGA?TTCGCTCCAGAG^ 

cacgatgatgagagtcaccatcacgaaaggctUag^ 

ATAAATTAAGATGTAAA6AGGTA6G6TGTTCCTGTGT6AC^ 

2,63 f?s8aiw?aa?ss8i8 

I TG6TTGATTTGGA6TGGCATCCTGA6TGTTGTCGACTTCATTATGGACCCA6TCAAACC 
219* pvull, 2209 tthllll 



TTAAACCCGTTTATACAAACATATTCTGGTCTAACCACCGSAATACTCTGTCGGTG^ 
2250 batXl, 



gatcacaaacttctccacccagttcactagtcctatcagAactcccgaa^ 

2306 bell, 2334 bglll, 

ItSiES! lSlEE T l! rA 1 * A 1 *ThrThrThrA 1 aPhaLeuVa 1 CvaLsuValLvsV* l u< l 

2343 ££SSS£$!I t 5 6accgctgcta cgactactgcattcctg6t^ 

tgcgcgtaaacctggcgacgatgctgatgacgtaaggaccataca^ 

,.„, 5?9SiZfi?y;)V5yS ln 5 1 >: Ilel - ,uTr B , -«ulI«L«uIl«ThrGlyAlaGln6IvL«u 
AG«6GtCAA6TGTTGCAA6GtATAC 

TCTCCGGTTCACAACGTTCCGTATGACACCAACTATGAGTATT6TCCCC6TGTTCCCGAG 

2S23 Higug^ 

CCCCGATGCCCGGAGTGGTGAGTCACCATACTTATGAGCCTACCCTACGCC6AC6TCCT6 
"257* patl, 

IrrrrirTlr?l^$n5PSZ5 Ly « e, X 6I y 6luI1 »»-yaTyrLeuIl»ThrCya61uArQ 
TG^CCTCAACATcIrlSpI^ 

1 bttt TCAACATCACACCACATTTCCACCTCTCTA6TTTATAGATTAAT6TACACTCTCC 
6AAGCCAGfiTATCTGGCCATTCTACAC^ 

CTTCGGTCCATAGACCGGTAAGAT6T6TGCTCTCGGGACGGCTGCA6ACATCATAAACTT 
26*7 batXI, 26S6 ball, 26fl«, aatll, 

^mi !:X5il;n?i??Si? , r y, S lu 5 lnfiluA «P v *l v «»61u M «tAapAapAsnPh«GluL«u 
2703 AAAA TCATAGATGGAAAAGAACAAGAGGACGTA6TGGAAATGGATGATAACTTTGAACTC 

TTTTAGTATCTACCTTTTCTTGTTCTCCTGCATCACCTTTACCTACTATTG^ 
5l3fi:5"£y» ,, roCysA«pA 1 «LyaProL«uVa I Arq6 1 yLyaPh»AanThrThrL»uL«u 

CCAGAAACGGGCACACTACGATTTG6GAACCATTCCCCTTTTAAATTATGTTGT6AAGAC 

ti?SiX^? A l;£!j55!!J"»tValCY»ProIl»61yTrpT 
2823 AA TG66CCAGCCTTCCAGAT6GTTT6CCCTATAGGATG6ACAGGfo 

TTACCCG6TCGGAAGGTCTACCAAAC6GGATATCCTACCTGTCCTTGACACTCA6ACACA 

• 

2883 CACTGGTCCAATA*GGAYACGTTA^ 

GTGACCAG6TTATTCCTATGCAATCGGTACT6GCAACATGCTTGTATGTTCTCCGTGTCC 

29*0 *tui. . 

m-k ????5?^??tS^^9S lnG lxCy*Il«ThrGlnLy»V«l I 1 e61 vG 1 yA«pL»uTyrA«p 

2 3 SSIIIS^S"II$5§^AGGtT6CATTACCCAGA*AGTCATCG66GG^ 

GGAAAGGGGAAATCCGTTCCGACGTAATG6GTCTTTCAGTAGCCCCCTCT66AGATGCTG 
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S003 : T GTGCCTTGGGAGG6AACT6GACTTGTGT AC CGGGGGACAT ACT ACGAT ATGTAGATGGG 
ACACGGAACCCTCCCTTGACCTGAACACATGGCCCCCTGT ATGATGCTATACATCT ACCC 

* 

ProVa IGluSerCysLysTr pCysGlyTyrLysPheHLsLysSerGluG lyLeuProHis 
5063 CCT GTCGAGT CTTGCAAGTGGTbTGGTT ACAAGTTTCATAAAAGTGAGGGTCTGCCACAC 
GGACAGCTCAGAACGTTCACCACACCAATGTTCAAAGTATTTTCACTCCCAGACGGTGTG 

PheProI 1 eGl yLysCysLysLeuLysAsnGluSerGl yTyrArgGlnValAspGluThr 
3123 TT CCC A ATTGGC AAGTb CAAGCTGAAGA ATGAAAGTG6CT ACAGACAAGTAGAiGAGACC 
AAGGGTTAACCGTTCACGTTCGACTTCTTACTTTCACC6AT6TCTGTTCATCTACTCTGG 

SerCysAsnArgAspGl yVa 1 Alal 1 eVa 1 ProThrG 1 ySerVa ILysCysLysI leGl y 
3183 TCTTGCAACAGAGACGGTGTGGCTATAGTACCAACTGGTTCGGTGAAATGCAAGATAGGG 
AGAACGTTGT CTCTGCCACACCGAT ATCATGGTTGACCAAGCCACTTTACGTTCTATCCC 

AspThrValValGl nVa 1 1 I eA I aMetAs pAspLy sLeuG I y ProMetProCysArg Pro 
3243 GACACAGTGGTGCAAGT CAT AGCAATGGATGATAAGCT AGGGCCT ATGCCTTGCAGACCA 
CTGTGTCACCACGTTCA6TATCGTTACCTACTATTCGATCCCGGATACGGAACGTCTGGT 

3301 ndel, 

TyrGl ul lei leProSerGluGlyProValGluLysThrAlaCysThrPheAsnTyrThr 
3303 T ATGAAATCATT CCCAGTGAGGGGCCGGTAGAAAAGACGGCATuTACCTTCAACTACACA 
ATACTTTAGTAAGGGTCACTCCCCGGCCATCTTTTCTGCC6TACATGGAA6TT6ATGTGT 

3306 xnnl . 

LysThrLeuLysAsnLysTyrTyrGluProArqAspAsnTyrPheGlnGlnTYrHetLeu 
3363 aaaacattaaagaacaagtattatgagcctagSgat AATTATTTCCAACAATACATGTTA 
TTTTGTAATTTCTTGTTCATAATACTCGGATCCCTATTAATAAAG6TT6TTATGTACAAT 

3390 avr2, 

LysGlyGluTyrGl nTyrTrpPhoAtpLeuGluI leThrAspHisHisArgAspTvrPhe 
3423 AAAGGGGAGTACCAATATTGGTTTGACCTAGAGATCACTGACCACCACCGGGATTACTTC 
TTTCCCCTCATGGTTATAACCAAACT66ATCTCTA6TGACTGGTGGTGGCCCTAATGAAG 

3436 sspl, 

AlaGluSerLeuLeuVal HeValVal AlaLeuLeuGlyGlyArqTvrValLpuTrpLeu 
3483 GCTGA6TCCCTACTG6TGATAGTGGTTGCACTCCTGGGt6GTAGGTAC6TGCTCTGGTTA 
CGACTCAGGGATGACCACTATCACCAACGTGAG6ACCCGCCATCCATGCAC6AGACCAAT 

LeuValThrTyrHetlleLeuSerGluGinnetThrSerGlYArgProValTrpAlaGly 
35 A3 CTGGTTACATATATGATCCTATCAGAACAAATGACCTCGGGACGTCCAGTATGGGCAGGT 
GACCAATGTATATACTAGGATAGTCTTGTTTACTGGAGCCCTGCAGGTCATACCCGTCCA 

3583 aatll,- 

GluIleValMetMetGlyAsnLeuLeuThrHisAspSerlleGluValValThrTyrPhe 
3603 GAAATAGTGAT6ATGGGCAACCTGCTAACACATGACAGTATT6AAGTGGTGACTTATTTC 
CTTT ATCACTACTACCCGTTGGACGATTGTGTACTGTCATAACTTCACCACTGAATAAAG 

LeuLeuLeuTyrLeuLeuLeuAroGluGluAsnlleLysLYsTrpVallleLeulleTYr 
3663 TTACTACTATACCTACTACTAAGAGAGGAAAACATCAAAAAATGGGTTATACTTATATAC 
AAT6ATGATATGGATGAT6ATTCTCTCCTTTTGTA6TTTTTTACCCAATATGAATATATG 



3723 



KisIIelleValHetHisProteuLysSerValThrValllBLauLeuMetValGlyGly 
CACATCATAGTAATGCACCCACTAAAATCAGTGACGGTGATACTGCTAATG6TTG6A6GG 
GTGTAGTATCATTAC^TGGGTGATTTTAGTCACTGCCACTATGACGATTACCAACCTCCC 

MetAlaArgAla61uProGlyAlaGlnSerPheLeu61uGlnValA8pL;uS«rPheScr 
3783 ATGGCAAG6GCAGAACCAGGCGCCCAGAGCTTCCTAGA6CAG6TGGACCTGAGTTTTTCA 
TACCGTTCCCGTCTTGGTCCGCGGGTCTCGAAGGATCTCGTCCACCTG6ACTCAAAAAGT 

3801 narl , 

MetlleThrLeuIleValValGlyLeuVall^ 
3843 ATGATCACGCTCATTGTAGTAGGTCTGGTCATTGCCAGGCGCGAC^ 

TACTAGTGC6AGTAACATCATCCAGACCAGTAACGGTCC6CGCTGGGGTGACACCACGGT 

3844 bell, 3902 spel t 

LeuValThrlleValAlaAlaLeuAroValThrGlvLauGlvPheGW 

3903 CTAGTCACAATAGTTGCAGCACTGAG66TAACGGGACT'AGGCTTTG66^ 

GATCAGTGTTATCAACGTCGTGACTCCCATTGCC CTGATCC6AAACCC6GGCCTCACCTA 

3948 apal f 
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Va 1 A 1 artetAl aVa 1 LeuThrLeuThrLeuLeuMet I leSerTyrVa I ThrAtpTyrPhe 
963 GT AGCTATGGCAGTCCT AACCTTGACCCTACTGATGATTAGTT ATGTGACAGACTACTT C 
• C AT CG AT AC CGT CAGG ATT GGAACTGGGATGACTACTAAT CAATACACTGT CTGATGAAG 



023 



083 



143 



203 



ArgTyrLysArgTrpLeuG 1 nCys 1 1 eLeuSerLeu! leAl aG I yVa 1 PheLeul leArg 

aggtacaaaagGtggctacaatgtatcctcagcttaatagccggggttttccttatacga 
t cc atgt tt tccaccgatgtt acataggagt cg aatt atcggccccaaaaggaat atgct 

SerLeuLysHi sLeuGl yG 1 ul 1 eG 1 uThrProG luLeuThrl I eProAsnTrpArqPro 
AGCCTTAAACATCTGGGCGAGATTGAGACCCCTGAGCTGACCATACCGAACTGGAGGCCA 
TCGGAATTTGTAGACCCGCTCTAACTCTGGGGACTC6ACTGGTATGGCTT6ACCTCCGGT 

LeuThrPhel I eLeuLeuTy rLeuThrSerAiaThrVa 1 Va IThrArgTrpLysVal Asp 
CT AACCTTCATACT ATTGT ACCTGACTT CAGCAACAGTTGTCACACGATGGAAAGTTGAC 
GATTGGAAGTATGATAACATGGACTGAAGTCGTTGTCAACAGTGTGCTACCTTTCAACTG 

1 1 eAlaGlyl 1 eLeuLeuG 1 nG I yProGlnSerPheCysOP 

ATAGCTGGCATATTACT6CAAGGGCCCCAATCCTTCT6CTGATTGCCACCTATGGGCT 
TAT CGACCGTAT AAT6ACGTTCCCGGGGTTAGGAAGACGACTAACGGTGGATACCC6A 

422* apal, 



.261 GACTTCCTGACCCTTGTATTGAT CCTGCCCACCCACGAATTAGTCAAGTTGTACTACCTG 
CTGAAGGACTGGGAACATAACTAGGACGGGTGGGTGCTTAATCAGTTCAACATGATGGAC 

r321 AAGACCGTCAA6ACT5ATGT6GAAAAGAGTTGGCTAG6GGGG6TG6ACTACAA6ACAATT 
TTCTGGCAGTTCTGACTACACCTTTTCTCAACCGATCCCCCCCACCT6AT6TTCTGTTAA 

MetAspGluSerGlyGluGlyValTyrLeuPheProSerLysGln 
;381 GGCTCTATTTATGATATGGATGAAAGTGGAGAGGGCGTGTACCTTTTCCCATCCAAACAG 
CCGAGATAAATACTATACCTACTTTCACCTCTCCCGCACATGGAAAAGGGTAGGTTTGTC 

AsnGl yLysLysAsnVaiSer I I eLeuLeuProLeul leArgA laThrLeul leSerCys 
^441 AATGGCAAGAAAAATGTCA6CATACTCTTGCCCCTCATTAGAGCTACACTAATAAGCTGT 
TTACCGTTCTTTTTACAGTCGTATGAGAACGGGGAGTAATCTCGATGTGATTATTCGACA 

4497 tthllli; 

X 1 eSerSerLysTrpGl nMetValTyrHetAlaTyrLeuThrLeuAspPheMetTyrTyr 
4501 ATCAGCAGCAAATGGCAGATGGTGTACATGGCTTACCTAACCTTGGACTTTATGTACTAC 

TAGTCGTCGTTTACCGTCTACCACATGTACCGAATGGATTGGAACCTGAAATACAT6ATG 

I leHisArqLysVal IleGluGluIleSerGiyGIyThrAsnVallleSerArgVallle 
4561 AT ACAC AGAAA6GTTATAGAAGAGAT CTCAGGGGG&ACCAATGTGATATCTA6GGTGAT A 
TATGTGTCTTTCCAATATCTTCTCTAGAGTCCCCCGTGGTTACACTATAGATCCCACTAT 

4583 bglll, 4605 ecorS , 

4 

A 1 aAl aLeuI 1 eG 1 uLeuAsnTrpSerMetGl uG luGl uGluSerLysGlyLeuLysLys 
4621 GCAGCACTCATAGAGCTAAACTGGTCTATGGAAGAAGAAGAAAGCAA6GGCTTAAAGAAG 
CGTCGTGAGTATCTCGATTTGACCAGATACCTTCTTCTTCTTTCGTTCCCGAATTTCTTC 

PhePhelleLeuSerGlyArgLeuLysAlaLeullelleLysHisLysVal ArgAsnGln 
4681 TTTTTTATACTATCTGGGAGGTTGAAGGCCCTTATAATAAAGCATAAGGTTAG6AACCA6 
AAAAAAT ATGATAGACCCTCCAACTTCCGGGAATATTATTTCGTATTCCAATCCTTGGTC 

ThrValALaSerTrpTyrGlyGluGluGluValTyrGlyHetProLvsValValThrll« 
4741 ACCGTAGCAAGCTGGTATGGGGAGGAAGAA6TCTACGGCAT6CCAAAAGTAGT6ACCAT A 
TGGCATCGTTC6ACCATACCCCTCCTTCTTCAGATGCC6TAC6GTTTTCATCACTG6TAT 

4778 sphl f 

IleArqAlaCysSerLeuAsnLysAsnLysHisCysllclleCysThrValCysGluAla 
4801 ATAAGGGCTTGCTCACTAAACAAGAACAA6CATTGCATAATATGCACAGTAT6T6A6GCT 
TATTCCCGAACGAGTGATTT6TTCTT6TTCGTAACGTATTATAC6TGTCATACACTCC6A 

LysLysTrpLys61yGlyAsnCYsProLysCysG.yArqHi»GlyLysProll«ThrCy« 
4661 AAGAAGTGGAAGGGTGGCAACTGCCCTAAATGCGGCCGCCACGGGAAGCCCATCACTTGT 
TTCTTCACCTTCCCACCGTTGACGGGATTTACGCCGGCGGTGCCCTTCGGGTAGTGAACA 

4893 x»a3, 

GlyMotThrLeuAl aAspPheGluGluArqHisTyrLytArgl lePhel leAraGlu61y 
4921 GGGATGACTCTAGCGGATTTTGAAGAGA6GCACTACAAGAGAATTTTCATAAGAGAGGGT 
CCCTACTGAGATCGCCTAAAACTTCTCTCCGTGAT6TTCTCTTAAAAGTATTCTCTCCCA 

ThrPheGluGl yProPheArqG InGluHisSerGI yPheVa lGlnTyrThrAl aArgGl y 

4981 acattcgaaggacccttcag2caggaacatagcgggtttgtacaatacaccgctag6g6A 
tgt aagctt cctgggaagtccgt ccttgt atcgcccaaacatgttatgtg6cgatcccct 
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CAA^GTTC^GA^TTT^^ 
"^CAAGGACTCCTTaIa^Wa^^ 

StK9 mstll, 

AACC?AG6S6TAQAAATC66?GA?CTA6^ ' sLeuLysMetG 1 nl 1 e 

^^5^ A ^^^^^ A TCTTTA6CCACTAGATCTT6TGGA^ 

5103 »vr2, 5124 Kbat , 5133 avr2, 5156 bglll, 

ttcgtgaXaaccctgaccggHaXgacc^ 1 e 

AAGCACTTTTGGGACTGGCCGTTCTGGTAGTGG6ACCTCCA^ 
5186 bstXl, 

IagaacgtgaXggccaXgatccag^ 

5251 xmnl v 
IlePheAlaGl 



5287 na*l. 5296 pvull. 



5343 sail, 



CCATCACCCG6AC66CACACGTTTTTC 



I 

ccc 



5W3 kpnl, 



tsmmsmmmmmmmsm 



5661 hpal, 

Ae6TTCCCTCGACGACATGTG6A66TTTTTTGTCCACCCCTTArtGTGTACACAAT6ACGT 
5800 ecorl, 

AGCCCTTGAGGTCGGAAGAAACTGGACTTTTTAAACTTCCCTACCAGTCCAGATGGGT^ 

AAACTCCGAAGATCACC6TCCCACCA6CC6TCTCAATTTCATCCTTTCTTACTCCTTAG6 
TTCG6GTGTTTTAATTACTCACCATA6GTTTGGCAGAGTTTTTC6TGTCG6CTAAATTGT 
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GlutletValLysLvsI 1 e ThrSerHe t AsnArqG 1 yAspPheLy sG I nl 1 eThrLeuAl a 
G AGA T GGT C A AG AAGAT A A C C AGCATGAACAGfcGGAG A CTTT AAG CAGAT AACCCTTGCA 
CTCTACCAGTrCTTCTATTGGTCGTACTTGTCCCCTCTGAAATTCGTCTATTGG^AAC6T 

rt l v< Zll* laG 1 YLysThrThrGluLeuProLysAlaVa 1 1 leGluGlu 1 1 e61 yAraHis 
A C AGGGGCAGGAAA A ACT AC AGA ACTC C CAAAGGC AGTGAT AGAGGAGAT AGGAAGACAC 
TGTCCCCGTCCTTTTTGATGTCTTGAGGGT7TCC6TCACTATCTCCTCTATCCTTCTGTG 
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Lys^rgValLeuValLeu 1 leProLeuArgAlaAlaAl aG 1 uSerVa 1 Ty rG I nTvrfle t 

AAGCGGGTGCTAGTGCTTATACCATTGAGAGCAGCAGCTGAGTCAGTCTATCAATACATG 
TT CGC CCACGAT CACGAAT ATGGTAACTCT CGT CGT CGACTCAGT CAGATA^ 

61S5 pvull, 6158 tthllll, 

*?9t5y!r^?t!i5^5 s SE lleSerPheAsnLe,|Ar 9llBGlyAspMetLysGluGlyAsp 
AGA II GAAA £ A I"CAGTATC^ 

TCTAACTTT6TAGGGTCATAGA6GAAGTTGAATTCTTATCCCCTGTACTTTCTTCCCCTG 
6210 aflll, 

l ?S5:^L aT ^E5iX lleThrT y rAl * SerT y r6l y T y rPheC ^sGlnMetProGinProLys 
AT 55£ aa S TGg6atcaccta CSCCTCATATGGATATTTTTGCCAAATGCCGCA6CCGAAG 
TACCGTT6ACCCTAGTGGATGC66A6TATACCTATAAAAACGGTTTACGGC6TCG6CTTC 

6266 ndel, 

^I?x5§SS£5St5I55^S gag ^^^^^ a ^ a ^ ata ^^^^^ gga ^ gag ^ a tcactgt6ctact 
gagtccc66c6tt accatctcataa6tat6tataaagacctactcatagtgacacgat6a 

?59^iySl5t5i!ii5^5 l lleGl Y L y sIleH i* Ar 9 Ph «ScrGluSerlleArgV*lV*l 
??I505£$5n5§£T§ T £ ATASGAAAAAT TCACA6ATTTTCTGAAAGCATAA6GGTGGTT 
GGACTCGTCAACCGACAGTATCCTTTTTAA6TGTCTAAAA6ACTTTC6TATTCCCACCAA 

$Li l ?5^I!i^ laThrProAlaGl y s «rValThrThrThrGly61nLysHi*ProXie61u 
G SI A I GA £CGCCACCCCAGCAGGGTCAGTAACTACAACAGG6CAAAAACACCCAATA6AA 

CGATACTG6CGGTGGGGTCGTCCCAGTCATTGAT6TTGTCCC6TTTTTGTG6GTTATCTT 

* 

G }V£DS^i eA1 S ProGluValMet LysGlyGluAspLeuGlySerGlnPheLeuA5pl le 
f55ITS6T65£ TCCTGAGGTGA TGAAAG6GGAAGACCTT6GAAGCCAGTTCCTTGACATA 
CTTAAGTATCGAGGACTCCACTACTTTCCCCTTCTGGAACCTTCGGTCAAG6AACTGTAT 

6481 ecorl, 6*93 mstll, 

^IiSi>ltS uL Y slleProValG1 « GluM etLysGlyAsnMetLeuVaiPheValProThr 
GCGGGGCTAAAAATCCCGGTTGAGGAGATGAAGGGTAACATGCTGGTCTTCGTACCCACA 
CGCCCCGATTTTTAGG6CCAACTCCTCTACTTCCCATTGTACGACCAGAAGCATGGGTGT 

^E9^ s 5 MetAlaValAs P ValAlal -ysLysLeuLysAlaLysGlyTyrAsnSerGlyTyr 
AGAAACATGGCAGTTGATGTAGCCAAGAAACTAAAAGCCAAGGGCTACAACTCA6GGTAT 
TCTTTGTACCGTCAACTACATCGGTTCTTTGATTTTCGGTTCCCGATGTTGAGTCCCATA 

TyrTyrSerGlyGluA^pProAl aAsnLeuArqVal Vai ThrSer61nScrProTyrVal 
TACTACAGTGGGGAAGASCCGGCTAACTTGAGGGTGGTAACATCACAGTCCCCATACGTC 
ATGATGTCACCCCTTCTGGGCCGATTGAACTCCCACCATTGTAGTGTCA66G6TATGCAG 

ValVal AlaThrAsnAlal leGluSerGlyValThrLeuProAspLeuAspThrValVa 1 
GTAGTAGCCACCAATGCCATTGAGTCAGGGGTAACGCTGCCAGATTTAGATACAGTTGTT 
CATCATCGGTGGTTACGGTAACTCAGTCCCCATTGC6ACGGTCTAAATCTATGTCAACAA 

AspThrGlyLeuLysCysGluLy«ArqValArqValSerSerLysIleProPhelleVal 
GACACAGGTCTGAAGTGTGAAAAGAGGGTGAGGGTGTCATCAAAAATACCTTTCATAGTA 
CTGT6TCCA6ACTTCACACTTTTCTCCCACTCCCACAGTAGTTTTTATGGAAAGTATCAT 

ThrGlvLeuLysArqHetAlaValThrValGlyGluGlnAlaGlnArqArqGlyArqVal 
ACAGGCCTTAAAAGAATGGCTGTCACTGTGGGCGAACAGGCTCAGCGAA6AGGCAGG6TA 
TGTCCGGAATTTTCTTACCGACAGTGACACCCGCTTGTCCGAGTCGCTTCTCC6TCCCAT 

6843 stul, 

GlyArgValLysProGl vArgTyrTyr ArqSerG 1 nG luThrA 1 aThr61 yStrLys A*p 
GGTAGAGTGAAGCCCGGTAGGTACTATAGAAGCCAGGAAACAGCGACCGGGTCAAAGGAC 
CCATCTCACTTCG6GCCATCCATGATATCTTCGGTCCTTTGTCGCT6GCCCAGTTTCCT6 

694S tthllll, 

TyrHisTyrAspLeuLeuGl nA 1 aHisArqTyrGly 1 leGluAspGl yl 1 #A»nVa i Thr 
TACCACTATGACCTGTTACAGGCACACAGGTATGGGATAGAAGATGGAATCAACGTGACA 
ATGGTGATACTGGACAATGTCCGTGTGTCCAT ACCCTATCTTCTACCTTAGTTGCACTGT 



6973 tthllll, 7017 tthllll 
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LysSerPhcA 
D21 AAGTCCTTTA 
TTCAGGAAA 

I^^5I?t?y^i^n?t?y^52 A5 2 L S uL S ul i eSer6 luAspLeuProAlaAlaValLys 
D8t ACC CA6CTGGAGAT ACTGAACAATCTACT CATCTCT6AAGACCTACCAGCAGCAGT AAAA 
TGGGTCGACCT CT ATGACTTGTTAGATGAGTAGAGACTTCTGGATGGTCGTCGTCATTTT 

7084 pvull, 

Asnl leMetAlaArgThrAspHisProGluProI 1 eGlnLeuAlaTyrAtnSerTyrGlu 
141 AA CAT CATGGCAAGGACTGATCACCCAGAACCAAT C CAGCTTGCAT ACAACAGTTATGAG 
TTGTAGTACCGTTCCTGACTAGTGGGTCTTGGTTAGGTCGAACGTATGTTGTCAATACTC 

7156 bell, 

Va lGlnVa 1 ProVa ILeuPheProLysI leArqAsnGlyGluValThrAspThrTyrGlu 
201 GT CCAGGTCCCTGTACTGTTT CCAAAAATAAGGAATGGGGAGGTTACAGATACTTACGAG 
CAGGTCCAGGGACATGACAAAGGTTTTTATTCCTTACCCCTCCAATGTCTATGAATGCTC 

AsnTyrSerPheLeuAsnAlaArgLysLeuGlyGluAspVal ProVa 1 Tyr 1 1 eTyrA I a 
261 AACTACTCATTCCTAAATGCAAGAAAACTAGGGGAAGATGTACCTGTGTACATTTATGCC 
TTGATGAGTAAG6ATTTACGTTCTTTTGATCCCCTTCTACATGGACACATGTAAATACGG 

ThrGluAspGluAspLeuA I aVa 1 AspLeuLeuGl yLeuAspTrpProAspProG I vA*n 
321 AC CGAAGATGAAGAC CTGGCAGT AGACCTTCT AGGCTTGGACTGGCCCGACCCA6GfaAAC 
TGGCTTCTACTTCT6GACCGTCATCT6GAAGATCCGAACCT6ACCGGGCTGG6TCCCTTG 

GlnGlnValValGluThrGlyLysAlaLeuLysGlnValValGlyLeuSerSerAlaGlu 
381 CAGCAAGTAGTGGAGACTGGbAAAGCACTGAAGCAAGTGGTAGGACTGTCCTCTGCTGAG 
GTCGTTCATCACCTCTGACCCTTTCGT6ACTTCGTTCACCATCCTGACAGGAGAC6ACTC 

7440 bsnl, 

AsnAlaLeuLeuI leAlaLeuPheGlyTyrValGl yTyrGl nAlaLeuSerLysArgHis 
441 AATGCCCT6CTCATA6CCCTGTTTGGGTATGTAG6ATATCAAGCTTT6TCAAAAAGACAC 
TTACGG6AC6AGTATCGGGACAAACCCAT ACATCCTATAGTTCGAAACAGTTTTTCT6TG 

7475 ecgrS, 7481 hindlll, 

ValProHetlleThrAspIleTyrThrlleGluAspGlnArgLeuGluAspThrThrHis 
'501 GTCCCAATGATCACAGACATATACACCATAGAAGATCAAAGACTAGAGGACACAACCCAC 
CAGGGTTACTAGTGTCTGTATATGT6GTATCTTCTAGTTTCTGATCTCCTGT6TTGGGTG 

7508 bell, 

LeuGlnTyrAlaProAsnAl al leArqThrGluGlyLysGluThrGluLeuLysGluLeu 
7561 CTCCAATATGCACCTAATGCTATAAGAACTGAGGGGAAGGAGACTGAACTAAAGGAATTA 
GAGGTTATACGT6GATTACGATATTCTT6ACTCCCCTTCCTCTGACTTGATTTCCTTAAT 

AlaValGlyAspMetAspArglleMetGluSerl leSerAspTyrAl aSerGl yGlyLau 
7621 GCAGTGGGTGACATGGACAGAATCAT6GAATCCATCTCAGAJTATGCATCAG6AGGGTTG 
CGTCACCCACTGTACCT6TCTTAGTACCTTAGGTAGA6TCTAATACGTAGTCCTCCCAAC 

7664 ava3, \^ 

ThrPhelleArgSerGlnAlaGluLysValArgSerAlaProAlaPheLvsGluAsny 
7681 ACATTCATAAGaTCTCAGGCAGAGAAAGTAAGaTCTGCCCCTGCATTCAAAGAAAACGT 
TGTAAGTATTCTAGAGTCCGTCTCTTTCATTCTAGACGG66ACGTAAGTTTCTTTT6CAC 

7690 bgill, 7711 bglll, 

GluAlaAlaLysGlyTyrValGlnLysPhelleAspAlaLeuU^ 
77 Al GAAGCT6CAAAAG66TACGTCCAAAAGTTTATTGATGCTCTTATT6AAAACAAAGAAACC 
CTTCGACGTTTTCCCATGCAGGTTTTCAAATAACTACGAGAATAACTTTT6TTTCTTTGG 

IlelleArgTyrGlyLeuTrpGlyThrHisThrAlaLeuTvrLvBSerlleAUM 
ATAATCAGATATGGCTTATGGGGAACACACACGGCACTTTACAAG^ 
TATTAGTCTATACCGAATACCCCTTGT6TGTGCCGTGAAATGTTCTCATAACGGCGTTCT 



7801 



* 

LeuGlyHisGluThrAlaPhBAlaThrLBuValllBLysTrpLeuAla^^ 
7861 CTGGGGCATGAAACAGCATTTGCTACGCTAGTGATAAAGTGGCTA6CCTTCGGW 

GACCCCGTACTTTGTCGTAAAC6AT6CGATCACTATTTCACCGATC66AA6CCCCCACTC 

7902 nhel , 

ProVaJSerAspHisValArGGlnAlaThrValAspLBuValValTv^^ 
7921 CCGGTGTCAGAT CATGTGAGACAGGCGACCGTTGACCTGGTCGTTT ATTa^ 

GGCCACAGTCTAGTACACTCTGTCC6CTGGCAACTGGACCA6CAAATAATACACTACTT6 

7954 tthllll. 
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LysProSerPheProG 1 yAspSerG 1 uThrG 1 nG 1 n6 i uG 1 yArq Ard'Phi.Va 1 A 1 aSer 

aaaccctctttcccaggSgaVtccgaaacccagcaggaggggagScgattcgttgccagc 

TTTGGGAGAAAGGGTCCCCTAAGGCTTTGGGTCGTCCTCCCCTCCGCTAAGCAACGGTCG 



6041 TTATTCAT CTCCGCTCTGGCAAC CT ACACATAC AAGACTTGGAACTACCACAACCTCTCC 
AATAAGTAGAGGCGAGACCGTTG6ATGTGTATGTTCTGAACCTTGATGGTGTTG6AGAGG 



6161 



B221 



8261 



LvsVal Va 161 uProA 1 aLeuAl aTvrLeuProTyrA i aThrSerAi aLeuLysMetPhe 

B101 aAggtagtagaaccagctttggcatAcctcccctacgctaccagtgcactgaaaatgttc 

TTCCATCATCTTGGT CGAAACCGTATGGAGG6GATGCGATGGTCACGTGACTTTTACAAG 
8151 xmnl , 

ThrProThrArqLeuGluSerGluVall leLeuSerThrThrl leTyrLycThrTyrLeu 

accccaactagactggagagcgaggttatacttagcactacaatatAcaXaacttXcctc 
tggggttgatctgacctctcgctccaatatgaatcgtgatgttatatgttttgaat6gag 

Serl leArgLysGlyLysSerAspGlyLeuLeuGlyThrGlylleSerAlaAlaHetGlu 
TCAATAAGSAAGGGGAAAAGTGATGGACTCTTGGGTACAGGGATTAGTGCGGCAATGGAA 
AGTTATTCCTTCCCCTTTTCACTACCTGAGAACCCATGTCCCTAATCACGCCGTTACCTT 

IleLeuSerGlnAsnProVa ISerVal Glyl leSerVa lMetLeuGlyVa 1G lyAlal le 
ATTCTGTCACAGAACCC6GTATCG6TAGGCATATCTGTTATGCTGGGGGTGGG6GCAATT 
TAAGACAGTGTCTTGGGC CATAGCCATCCGT AT AGACAATACGACCCCCACCCCCGTTAA 

8284 tthllll, 

AlaAlaHisAsnAlalleGluSerSerGluGlnLysArqThrLeuLeunetLysValPhe 

63^1 gccgctcacaatgccattgagtctagcgaacaaaJUaggaccct^ 

CGGCGAGTGTTACGGTAACTCA6ATCGCTTGTTTTTTCCTGGGACAACTACTTTCACAAG 
8391 xmnl , 

VaLLysAsnPheTrpSerGinAlaAlaThrAspGluLeuValLysGluAsnProGluLys 
8401 GTAAAAAACTTCTGGAGCCAGGCAGCAACAGATGAATTGGTGAAGGAAAATCCAGAAAAA 
CATTTTTT6AAGACCTCGGTCCGTCGTTGTCTACTTAACCACTTCCTTTTAGGTCTTTTT 

1 lei leMetAlaLeuPheGluAlaValGlnTbrlleGlyAsnProLeuArqLeuIleTyr 

8461 ataataatggccctatttgaagcagttcagacaattggtaaccctctgagScttatatat 

TATTATTACC6GGATAAACTTCGTCAAGTCTGTTAACCATTGGGAGACTCCGAATATATA 
8479 xmnl, 8497 bstE2 v 

!2i5tSSlYEf lY^2*^Y r ^Y r ^Y s ^^ t^ r E^^ uA * a ^Y 5 ®^ u ^ eu SerGiuArqThrAla 
8S21 CACCTGTATGGAGTTT ACTACAAAGGCTGGGAAGCAAAAGAACTATCCGAGAGGACAGCA 
GTGGACATACCTCAAATGATGTTTCCGACCCTTCGTTTTCTT6ATA66CTCTCCTGTCGT 

GlyArgAsnLeuPheThrLeulleMetPheGluAlaPheGluLeuLeuGlyMetAspSer 
8S81 GGt AGGAACCTGTTCACTTTGATAATGTTCGAAGCTTTC6AACTGTTA6G6AT6GACTCT 
CCGTCCTTGGACAAGTGAAACTATTACAAGCTTCGAAAGCTTGACAATCCCTACCTGAGA 

8586 xnnl, 6612 hindlll, 

GluGlyLvsl 1 eArqAsnLeuSerGlyAsnTyrl leLeuAspLeuI leTyrSerLeuHis 
8641 GAAGGGAAGATAAG6AACCTGTCTGGAAATTATATCTTGGATTTGATCTATAGTTTACAT 
CTTCCCTTCTATTCCTTGGACAGACCTTTAATATAGAACCTAAACTAGATATCAAATGTA 

LysGlnl leAsnArgSerLeuLysLysValValLeuGlyTrpAlaProAlaProPheSer 
6701 AAACAGATAAACAGAAGCTTGAAGAAAGTGGTCCTGGGGTGGGCTCCCGCACCTTTTAGT 
TTTGTCTATTTGTCTTCGAACTTCTTTCACCAGGACCCCACCCGAGGGCGTGGAAAATCA 

8715 hindlll , 



CysAspTrpThrProSerAspGluArg I leArqLeuProThr AspAsnTyrLsuArqVa 1 
8761 TGTGACTGGACTCCTAGTGATGAGAGAATTAGGTTACCCACAGACAACTATCTAAGAGTG 
ACACTGACCTGA66ATCACTACTCTCTTAATCCAATGGGTGTCTGTT6ATA6ATTCTCAC 

8792 bstE2, 

GluThrLysCysProCysGlyTyrG luMetLysA 1 aLeuArgAsnVa 1 SerGiySerLeu 

8821 gagactaagtgcccatgtggttatgagatgaaagcactaagSaacgttagtggcagtctt 
ctctgattcacgggtacaccaatactctactttcgtgattcctt6caatcaccgtcagaa 

ThrlleValGl uGluLy sGlyProPheLeuCysArdAvnArgProGl yArgGlyProVal 
8881 ACTATAGTGGAAGAGAAAGGGCCTTTTCTCTGTAGSAACAG6CCTG6TAGAGGGCCAGTT 
TGATATCACCTTCTCTTTCCCGGAAAAGAGACATCCTTGTCCGGACCATCTCCCGGTCAA 

8920 stul, 8938 hpai , 

AsnTyrArgValThrLysTyrTyrAspAspAftnLeuAl aG 1 ul 1 eLy sProVa 1 ArgArg 
6941 AACT ATAGA6TT AC AAAAT ACT ATGAT GA C AACCTCG CAGAGATAAAGCCAGTTCGAAGA 
TTGATATCTCAATGTTTT ATGATACTACTGTTGGAGCGTCTCTATTTCGGTCAAGCTTCT 
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LeuGluGlyLeuValGluHisTyrTyrLysGlyValThrAlaArglleAspTyrGlyLys 
C T AG AAGGA CT CG T GG AGC A CT ATT AC A A AGGTG TC A CAG C A AGG AT AGAT T ATGGC AAG 
GAT CTT CCTGAGCAC CT CGTGATAATGTTT CCACAGTGT CGTT CCTATCTAATACCGTT C 

GlyLysHetLeuLeuAl aThrAspLysTrpGluValGluHisGly I ieVa IThrArqLeu 
GG AA AAATG CTGTT AGC C AC T GAT A AATGGuAGGTGG AGCACGGT AT CGT AACT AGbTTG 
CCTTTTTACGACAATCGGTGACTATTTACCCTCCACCTCGTGCCATAGCATTGATCCAAC 



AlaLysLysTyrThrGI yVa 1 Gly PheLysGl yA 1 aTyrLeuG lyAspGl uProAsnHis 
7121 GCGAAGAAGTACACTGGTGT TGGGTTCAAGGGAGCATAC CTGGGTGACGAGCCCAACCAC 
CGCTTCTTCATGTGACCACAACCCAAGTTCCCTCGTATGGACCCACTGCTCGGGTT6GTG 

ArqAspleuVa 1 GluArqAspCysAl aThrl 1 eThrLysAsnThrVa I GlnPheLeuLys 
Mat CGT GACCTAGTGGAAAGAGACT&TGCAACCATAACCAAAAAT ACAGTTCAGTTTTTGAAA 
GCACTGGATCACCTTTCTCTGACACGTTGGTATTGGTTTTTATGTCAAGTCAAAAACTTT 

MetLysLysGlyCysAlaPheThrTyrAspLeuSerLeuSerAsnLeuThrArgLeuIle 
?2A1 ATGAAGAAAG6 tT&TGCATTTACCT ATGACTT6TCCCTGTCCAATTTGACCAG6TT AATT 
TACTTCTTTCCGACACGTAAATGGATACTGAACAGGGACAGGTTAAACTGGTCCAATTAA 

G luLeuValHisLysAsnAsnLeuG luGluLysAspI 1 eProA 1 aAl aThrLtuThrThr 
?301 GAATTGGTGCAC AAAAAT AAC CT TGA AGAGAAAG A C AT AC CAGCCGC CAC ATT AACAA CA 

CTTAACCACGTGTTTTT ATTGGAACTTCTCTTTCTGTAT6GTC6GC6GT6TAATTGTTGT 

CysLttuAlaTyrThrPheValAsnGluAspIleGlyThrlleLysProValLeuGlyGlu 
7361 TGCCTAGCTTACACATTTGTGAATGAA6ATATCGGGACTATAAAACCA6TACTGGGGGAG 
ACGGATCGAATGTGT AAACACTT ACTTCTATA6CCCTGATATTTTGGTCATGACCCCCTC 

9388 ecorS, 9408 seal, 

ArqVallleAlaA5pProValValAspIleAsnLeu6inProGiuValGlnValAspThr 
9421 AGAGTGATAGCCG ACCCAGTG6TAG ACATTAACTT ACAACCAGAAGTGCAGGTGGATACA 
TCTCACTATCGGCT6GGTCACCATCTGTAATTGAAT6TTGGTCTTCACGTCCACCTATGT 

SerGluValGlylleThrLeuValGlyAraAlaAlaLeunetThrThrGlylleThrPro 
?*81 TCAGAGGTTG6GATCACTCTGGTTGGAAGAGCAGCCTTGATGACAACAGGTATTACACCC 
AGTCTCCAACCCTAGTGAGACCAACCTTCTCGTCGGAACTACTGTTGTCCATAATGTGGG 

Va I Va lGluLysThrG 1 uProAsnAl aAspGl ySerProSerSerl leLysI leGlyLcu 
9541 GTGGTTGAAAAAACAGAGCCTAATGCCGATGGCAGTCCAAGCTCTATAAAGATTGGACTG 
CACCAACTTTTTTGTCTC6GATTACGGCTACCGTCAGGTTCGAGATATTTCTAACCTGAC 

AspGl uGly CysTyrProGl y ProArgProG InAspHisThrLeuAlaAspGlulleHls 
9601 GACGAAGGATGTTaCCCAGGGCCTAGaCCGCAAGACCACACTTTAGCTGACGAAATACAT 
CTGCTTCCTACAATGGGTCCCGGATCTGGCGTTCTGGTGTGAAATCGACTGCTTTATGTA 

SerArqAsp61uArgProPheVa lLeuVa ILeuGlySerArgSerSerMetSerAsnArg 
9661 TCTAGGGATGAAAGGCCCTTTGTTTTGGTCTTGGGTTCAAGaAGTTCCATGT CAAATAGA 
AGATCCCTACTTTCCGGGAAACAAAACCAGAACCCAAGTTCTTCAAGGTACAGTTTATCT 



A I aLysThrAl aArgAsnl I eAsnCysThrGlnLysArgProGlnGluI leArgAspLeu 
9721 GCAAAAACTGCTAGAAACAT CAACTGTACACAGAAAAGACCCCAGGAAATTAGA6ATCTG 
CGTTTTTGACGATCTTTGTAGTTGACAT6TGTCTTTTCTGGGGTCCTTTAATCTCTAGAC 

9774 bglll, 

MetAl aGlnGl yArgHetLeuVa I Va 1 A I aLeuArgSerPheAsnProG luLeuSerG 1 u 
9781 ATGGCACAAGGfaCGT ATGCTAGTAGTGGCTTTAAGAAGTTTCAATCCTGAGTTGTCT6AA 
TACC6TGTTCCCGCATACGATCATCACC6AAATTCTTCAAA6TTAGGACTCAACA6ACTT 

98*0 spel 9 

LeuVal AspPheLysGl yThrPheLeuAspArgVal A 1 aLeuG 1 uAl aLeuSerLeuG 1 y 
9841 CTAGTT6ATTTCAAGGGGACTTTCTTGGATAGGGTTGCCTTGGAA6CCCTTAGCCTGGGG 
GATCAACTAAAGTTCCCCTGAAAGAACCTATCCCAAC6GAACCTTCGGGAATCG6ACCCC 

9900 bgll, 

ProGlyArqProLysGl nValThrThrAlaThrVa lLysGluLeuLeuduGlnGluGlu 
9901 CCGGGAAGGCCCAAGCAGGTAACCACAGCCACAGTT AAGGA6TTGCT AGAGCAAGAGGAA 
GGCCCTTCCGGGTTCGTCCATT6GTGTC6GT6TCAATTCCTCAACGATCTC6TTCTCCTT 

9918 bstE2, 

GlnValGluIleProAsnTrpPheGlyAlaAspAspProValPh«1-;uGluValAlaL«u 
9961 CAAGTCGAGATCCCCAACTGGTTCGGTGCGGATGACCCAGTCTTCTTGGAAGTAGCTCTG 
GTTCAGCTCTAG6GGTTGACCAAGCCACGCCTACTGGGTCA6AA6AACCTTCATCGAGAC 



9994. tthllll 



Ly sG 1 yAspLys I y r H i sLeuVa 1 61 yAspVa 1 AspLy sVa ILvsAsoG 1 nAlaLvxci u 

aagggtgacaaat accactt agtaggtgatgt agataaagt aaaagatcaagcaaagcgX 
ttcccactgtttatggtgaatcatccactacatctatttcattttctagttcgtttccct 

LeuGlyAlaThrAspGlnThrArqIleValLysGluValGlyAlaArqThrTvrThpHat 
CTAGGGGCCACGGACCAAACTAGAATAGTAAAAGAAGTAGGTGCGAGAACCTACACAATS 
GATCCCCGGTGCCTGGT7T6ATCTTATCATTTTCTTCATCCACGCTCTTGGAT6TGTTAC 

LysLeuSerSerTrpPheLeuGlnAl aSerSerLysG 1 nHetSerLeuThrProLeuPhe 
AAGCTGTCTA6TT66TTTCTTCAAGCATCAAGTAAACAGATGAGCTTGACCCCTTTGTTC 
TTCGACAGATCAACCAAAGAAGTTC6TA6TTCATTTGTCTACTCGAACT6G 

GluGluLeuLeuLeuArqCysPraPraLysnetLysAsnAsnLvs61 vHislleGlvfiar 
6AGGAACTGTTGCTTCGT TGCCCTCCCAAGATGAAGAACAATAAAGGGCATA^ 
CTCCTTGACAACGAAGCAACGG6AG6GTTCTACTTCTT6TTATTTCCC6TATAGCCTA6T 

iiilYt§ In tS uAla6In5l Y AsnTp B 61uProLeuA spCy*61yV*lMi«L«uGlvThr 

6CCTACCAACTAGCTCAGGGCAACTGGGAACCCCTCGATTGT6GAGTACACCT66GCA 

C6GAT66TT6ATC6A6TCCC6TTGACCCTTGG6GA6CTAACACCTCAT6TG6ACCC6T6G 

ii?EE2ftii A !:9 Ar 9 v * 1L Y* llBHlaP « : ' oT y r61l 'Al«TyrLeuLy»L«uLy«A«oL«u 
ATACCTGCCAGGA6GGTAAAGATCCACCCATAT6A6GCCTATCTGAiACTGAAGGAtTTA 

TATGGACSGTCCTCCCATTTCTAGGTGGGTATACTCCGGATAGACTTTGACm^ 
10349 ndel, 10355 stui , 

Leu61uGluGlu61uArQLysPro61uGlyArgABpThrValIl«ArgGluHi«AcnLv« 
TTAGAAGAAGAAGAGAGGA^GCCAGAGGGTAGXGAtACAGTGATAAGAGAACAT^ 

AATCTTCTTCTTCTCTCCTTC66TCTCCCATCTCTATGTCACTATTCTCTT6TATT6TTC 

I?iJi5t5i!V^?^?y5l A E9 ProProAr 9 L y BProGlnT Yi , LysGluA«nProGlnPro 
TG6ATCCTCAAAAAA6TGAGbCCACCAAGSAAACCTCAATACAAA6AAAATCCTCAACCC 

ACCTA66A6TTTTTTCACTCC66T66TTCCTTT6GAGTTATGTTTCTTTTA66AGTT^ 
10442 bamhl, 

T66AAAGCTATCA6AGCAACTAGACTA6AGAAG6GCATAAAA6AAACATCTATAATAArr 
ACCTTTCGATAGTCTC6TTGATCTGATCTCTTCCCG^ 

LysLeuAl aSerl 1 eLeuThrGlyAl aGly 1 leAraLeuGluLvsLauPpol/alUA 1 Am 
aSaTTGGCCTCCATACTAACAGGTGCAGG^TA^^ 

tttaaccggaggtatgattgtccacgtccttattc^ 

Ala61 nThrAspHisLysSerPheHisGluAlal leAraAsoLv«Il«A«nL vcA«nfii*i 
C6GGTTTGACTG6TATTTTCAAAGGTACTCCGTTA6TCTCTATTCTATCT6TTCTTGCTT 

AsnGlnGlnSerProGlyLeuHisAspLysLeuLeuGluIlttPheHisThpl IsAlafii n 
^^^^"CAGGJtTACATGA?^ 

TTAGTCGTCTCGG6TCCTAATGTACTATTTAACAATCTCTAGAAAGTGTGTTATCGGGTT 
10716 bglll, 

^2f?^5Vt5f5t!i5T5£lY r6l X GluV * l ThrTrpGlu61nL«uGluAl«61yIlBAsn 
SSS^SSI^^S^^TTACGGtGAAGTGACGTGfcGAAC^^ 

GG6TC6GATTTCGT6TGAATGCCGCTTCACT6CACCCTTGTC6AACTCCGTCCCTAGTTG 



A ?2A5f?riX^i^i?SiX5^ Leu6luL y Bl -y«ABnLBu61y61uV«lLeuA«pSerGlu 
$55tttSo5S£ISStSS£TTT£^ agaaaa ® aagaa TCTT66AGAA6TACT6GACTCA6A6 
TCTTTTCCCCGACGTCCGAAA6ATCTTTTCTTCTTAGAACCTCTTCAT6ACCTGA6TCTC 

10811 pstl, 10821 xbal, 10845 seal, 
TTrCTrrIrrIrr$I^$SI$ A IS0S5S5l£ T S AAAA 

TTC6TG6ACCACCT6GTT6ATTA6TCTCTG6ACTTTTGTCCTGCCTTCTATTCTATAATA 

CTCTGTCGTTATGGATTCT7GCTCTTCTCCCTACA6TCACT6CTAACC6TTC6TCCCCT6 

Hf ValAs pGluLvsLvsProAraVallleGlnTyrProGluAlALysThrArgLauAl* 
A I05IIS A ! GA 5 AAGAAACCAA6AG T G ATTCAATACCCTGAAGCTAA6ACAA6XCTGGCC 
TATCAACTACTTTTCTTTGGTTCTCACTAA6TTATGGGACTTC6ATTCTGTTCT6ACCGG 

11036 ball, 

U5l^ l :^?^5i , ?5^I^S: AsnTr B v *iLyfiGinGinProValValIl«ProGlyTyrGlu 
$IS5SI6555 T T0ISI?S6 ACTG ^GTGAiGCAGCAGCCTGTTGTGATCCCA6G6TATGAA 
TAGTGATTTCAATACAT6TT6ACCCACTTCGTCGTC6GACAACACTAGGGTCCCATACTT 



Figure 2 



GtyLysThrProLeuPheLysllePheAsnLysValArgLysGluTrpAspLeuPheAsn 
?1 GGGAAGACCCCATT AT T CAAGAT CTTTAACAAGGT AAGaAAG6AATG6GACCTGTTCAAT 
CCCTT CTGGGGTAAT AAGT T CTAGAAATTGTTCCATTCTTTCCTTACCCTGGACAAGTT A 

11120 bglll , 

GluProVa 1 Al aVa 1 SerPheAspTh rLysA 1 aTrpAspThrG 1 rWa I ThrSerAra As d 
61 GAGCCAGTAGCTGTGAGTTTTGATACTAAGGCCTGGGACACCCAAGTCACTAGTAGGGAT 
CTCGGTCATCGACACTCAAAACTATGATTCCGGACCCTGTGG6TTCAGTGATCATCCCTA 

11189 stul, 11209 spel, 

LeuArgLeuI leGl yG lul 1 eGinLysTyrTyrTyrArgLysGluTrpHi*Ly«Ph«I 1» 

zi ctacggcttattggtgaaattcaaaaatattactacagGaaggagtggcacaaattcatc 

GATGCCGAATAACCACTTT AAGTTTTTATAAT6ATGTCCTTCCTCACCGTGTTTAAGTAG 
11246 sspl, 11276 cial, 

AspThrl leThrAspHisMetVaLGluValProVallleThrAlaAspGlyGluVatTyr 
SI GAT ACCATCACCG ACCAC ATGGTGGAGGT ACCAGTCATAACAGCAGATGGTGAAGTATAC 
CT ATGGT AGTGGCTGGTGT AC CACCT CCATGGTC AGT ATTGTCGTCTACCACTTCATATG 

11307 kpnl, 

1 1 eArgAsnGl yGl nArgGl ySerGl yGl nProAspThrSerA 1 aG 1 yAsnSerMetLeu 
41 ATAAGAAATGGACAAAGGGGT A6TGGCCAGCCAGACACAAGCGCAGGT AACAGCAT6CT A 
TATTCTTTACCTGTTTCCCCATCACC6GTCGGTCTGTGTTCGCGTCCATTGTCGTACGAT 

11364 ball, 11393 sphl, 

AsnValLeuThrMetMetTyrAlaPheCysGluSerThrGlyValProTyrLysScrPhe 
01 AATGTGTTAACAATGATGTATGCCTTCTGTGAAAGTACGGGfaGTTCCATATAAGAGTTTT 
TTACACAATTGTTACT ACATACGGAAGACACTTTCATGCCCCCAAGGTATATTCTCAAAA 

11406 hpal, 

AsnArgVal AlaAroI leHisVa 1 CysG lyAspAspGl yPheLeul leThrGluArqG 1 y 
61 AATAGAGTTGCAAGGATCCATGTCTGTGGGGATGACGGtTTCCTGATAACAGAGAGGGGG 
TTATCTCAAC6TTCCTA6GTACAGACACCCCTACTGCCGAAGGACTATTGTCTCTCCCCC 

11474 bamhl, 11476 bstXI, 

LeuGl yThrLvsI leCysGl nGlnArgAspAlaAsnPheCvsttetArqArqAlaSerSer 
21 CTGGGCACTAAAATTTGCCAACAAAGGGATGCAAACTTCTGCATGAGGCGG6CAAGCTCA 
GACCC6TGATTTTAAACGGTTGTTTCCCTACGTTTGAA6AC6TACTCCGCCCGTTCGA6T 

LysAsnAsnAraArqGlyLysAsnGluSerLeuProI leGlyLeuArgHisArgValLeu 
61 AAAAATAACAGAAGGGG&AAGAATGAAAGCTTGCCTATAGGTTTGAGGCATA6AGTTTTG 
TTTTTATTGTCTTCCCCTTTCTTACTTTCGAACGGATATCCAAACTCCGTATCTCAAAAC 

11607 hindlll , 

LeuProHisThrSerProArgLysCysLeuI lei leProAl aAlaThrTrpProValGl y 

.41 ctcccacacaccagtccccgtaagtgtctgataataccagcagctacatggccggtaggc 
g agggt6tgtggtcag6ggcattcac ag act att atggt cgtc6atgtacc66ccatccg 

ThrAlal lelleLeuSer^vsMetAlaAsnLyslleGlyLeuSerGlyGluArgGlyThr 
'01 ACTGCCATTATATT ATCAAAGATGGCCAACAAGATTGGATTAA6TGGAGAGAGA66TACC 
TGAC6GTAATATAATAGTTTXTACCGGTTGTTCTAACCTAATT CACCT CTCTCTCCATGG 

11723 ball, 11755 kpnl , 

ThrAlaTyrGluLysAlaVal AlaPheSerPheLeuLeuMetTyrSerTrpAsnProLeu 
761 ACGGCATATGAAAAGGCAGTGGCTTTCAGTTTCTTGTTGATGTACTCCTG6AATCCACTT 
TGCC6TATACTTTTCCGTCACC6AAAGTCAAA6AACAACTACATGAGGACCTTA6GTGAA 

11765 ndel, 

ValArgArqllsCysLeuLeuValLeuSerGlnHisProGluThrAlaProSerThrGln 

B2i gtaagSagSatttgtctcctggttctttcacagcatccagaaacagctccatcaacccag 

CATTCCTCCTAAACA6AGGACCAAGAAAGT6TCGTAGGTCTTTGTC6AG6TAGTTG66TC 

ThrSerTyrTyrTyrLy«GlyA»pProlleGlyAlaTyrLysAspVal lieGlyLy«Asn 

B81 ACCTCTTACTATTATAAAGGAGACCCAATAGGGGCCTATAAAG 

TGGAGAATGATAATATTTCCTCTGGGTTATCCCCGGATATTTCTACAATATCCTTTTTTA 

« 

LeuSerGluLeuLysArqThrGlyPheGiuLysLeuAlaAsnLeuAsnLjuSjrLjuSer 
941 CTGAGTGAACTAAAAAGGACGGGTTTTGAAAAATTGGCTAATCTAAATCTAAGCCTG 

GACTCACTT6ATTTTTCCTGCCCAAAACTTTTTAACC6ATTAGATTTAGATTCGGACAGG 



,~ 0208672 

/[ ^ J t iyurtt « - Sheet 14 ot 14 

ThrLeuG I y 1 1 eTrpSerLy sHisThrSerLysArg 1 1 el 1 eGl nA«pCv*Va IThrl 1 • 

< TGTGATCCTTAGACCAG6TTT6TATGTTCATTT6CTTATTAGGTCCT6ACACATTGGTAG 

^IXty 8 ? 1 uA *PS 1 yAsnTrpLeuVa 1 AsnAl tAspArgLeul lcSerSerLvsThrGI v 
3M 55S$aa5$55*CGGfcAATTGgcTGGTAAATGCCGASAGS^ 

CCCTTTCTCCTGCCGTTAACCGACCATTTACGGCTGTCCGACTATAGTTCGTTTTGACCG 

12102 ecorS, 12117 ball, 

,•>< t!i§i:?cI?^!i?^9^?B L y s61 X T yE ThrL « u 6»n6lyLyBHi«Tyr61u61nLBu61n 
121 ^I?IfI?^^I$SSIS5>^S6S§TTATACATTACASe65AAACACTJkTGAACAACTTCAA 
GTAGACATGTATGGACT6TTTCCAATAT6TAATGTCCCTTTT6T6ATACTT6TT6AAGTT 

12169 trnnl, 

t5li5! Diif A E9 ThrS »rProIl»M«tGlyV*lGlyThrGluAraTyrLv«L»uGl vPro 

8t II?S$SSS$$S5$SI$SSS£5ai£eiS 6s * 6TA6 65ACA6 

AAC6TCCGTTCTT6ATCGG6TTA6TACCCTCATCCCT6TCTCTCTATATTTSATCCA66A 

• ATCATTT6AACGAC6ACTCCTCCAACTTTCA66AC6AATACC6TC6ACA6CCAC6GTCG 
12284 pvull, 

SerOP 

TCAACTTTATTTACATATATAACATGTATTTAGACATAAACATATATAATATATATTTGA 
ATCAACTCTAATCATCACTATATATCAATAGATG6A6TTCATTT6TGATGT6AGTTACGT 

12*52 aatll, 12457 ncal , 

*Bi TTAACAGCCCCA 
AATTGTCG6GGT 
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Description 

Technical Field 



5 This invention relates to the field of vaccines and diagnostics for infectious diseases. Specifically, it 
relates to the disease syndrome caused by bovine diarrhea virus, and to vaccines, therapeutics, and 
diagnostics derived from the genomic sequence associated with the BDV virus. 

Background Art 

10 

Morbidity and mortality caused by bovine diarrhea virus (BDV) in dairy and beef herds is a worldwide 
unsolved economic problem. A subclinical form characterized by high morbidity and tow mortality is 
endemic and is associated with diminished respiratory capacity, neonatal diarrhea, ulcerations in the 
digestive tract, immunodeficiency, and, in calf bearing bovines, abortion and teratogenicity. The disease is 
75 recognizable in calves, but adult carriers are difficult to identify. 

An acute form of the disease results from infection of the fetus in the first trimester of pregnancy. The 
course of this form of the disease is insidious. The calves may survive the first infection, but those that do 
become immunotolerant, and excrete live viruses. They cannot survive a second infection. Since their 
capacity as carriers cannot be detected by titration of their sera, these animals are responsible for 
20 spreading of the disease from herd to herd. 

BDV also infects hog populations. In hogs, it is important to distinguish animals as being infected by 
either BDV or hog cholera virus, since hog cholera is an economically important disease, while the bovine 
diarrhea infection is of transient significance, and could, for the most part, be ignored. Hogs infected with 
cholera must be slaughtered, and since present diagnostic methods in hogs cannot distinguish between 
25 these two types of infection, hogs which are, in fact, only infected with BDV must also be destroyed. 

Present means of detection of BDV infection in calves are equally deficient, in that they rely on titration 
for antibodies in sera, which titration will fail to detect the immunotolerant calves. Thus, a diagnostic method 
is desired, but presently unavailable, which is capable both of detecting the presence of the virus in 
newborn animals with chronic infections, and in distinguishing between hog cholera virus and BDV 
30 infections. This could be accomplished either using antibodies with high affinity and specificity for the virus 
particles or using nucleic acid oligomeric probes capable of specific hybridization to the viral sequences. 

Similarly, in addition to the need for improved diagnostics, there is, at present, no effective vaccine 
which is successful in preventing the spread of the disease caused by BDV. It is, of course, desirable that 
such a vaccine would confer long-term immunity, would not infect the fetus of the inoculated animal, and 
35 would have no undesirable side effects such as induction of immunotolerance to the virus, or depression of 
the immune system. These characteristics are difficult if not impossible to acquire in an attenuated or killed 
virus vaccine. Such vaccines, for the most part, constitute the present state of the art (Saurat. P., et al, "La 
Maladie des Muqueuses" (1972) pp. 229-251, L'Expansion scientific francaise Paris). Recently, Femelius, A. 
U et al, (Am J Vet Res (1971) 32:1963-1979) have reported a vaccine prepared from a high molecular 
40 weight soluble antigen obtained by density gradient centrifugation from BDV virus grown in embryonic 
bovine kidney cells. 

The approaches used in the art for the detection of and protection against bovine viral diarrhea have 
been largely empirical and have not utilized refined knowledge of the nature of the vector causing the 
disease. The bovine diarrhea virus has, however, been classified, along with hog cholera and border 

46 disease viruses as a pestivirus which is a member of the family Togavtridae (Porterfield, J. S., "The 
Togavirions. Biology. Structure, Replication" Schlesinger. W., Ed. (1980). Academic Press, pp. 17-24). 

By analogy to other togaviruses, these viruses should contain a capsid protein and two or three 
membrane glycoproteins (Horzinek, M.C., Non-arthrpod borne Togaviruses (1981), Academic Press, London. 
Epitopes which are capable of raising antibodies associated with neutralization and protection against 

50 infection are expected to be contained in the membrane proteins (e.g., see Boere, W., et al, J Virol (1984) 
52:572-582). The pestiviruses are also characterized by soluble antigens that are approximately 80 kD 
proteins. A 76 kD protein from BDV has, in fact, been used as an experimental vaccine (Femelius, A.L., et 
al, supra). 

55 
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Disclosure of the Invention 



The invention is as described in the appended claims 1 to 21 . 

The invention provides cDNA copies of the entire bovine diarrhea virus RNA genomic sequence. This 
5 makes available the entire repertoire of peptides synthesized by the virus, and makes possible the 
preparation of proteins which contain epitopes effective and specific in generating desired antibodies and. in 
providing cells suitable for production of monoclonal antibodies. The primary structure of the genome also 
provides the necessary information to construct oligomeric sequences useful as diagnostic probes. 

The protein products are thus able to serve as vaccines to protect animals subject to infection by this 
ro virus from subsequent illness. The accessibility of the entire genome provides opportunities for production 
of effective proteins, such as major virion components and individual virion subunits which would be 
unavailable using "native" production techniques, i.e., from viral infection of tissue cultured cells. 

Accordingly, in one aspect, the invention relates to a nucleotide sequence substantially identical with 
that representing the entire genome of BDV as shown in Figure 2. Other aspects of the invention concern 
75 DNA or RNA sequences derived from portions of the genome, said sequences not necessarily representing 
contiguous portions. These are useful both as diagnostic probes and as coding sequences for desired 
proteins. 

Other aspects of the invention include expression systems for the foregoing DNA derived from BDV 
which are effective in expressing this DNA in suitable heterologous hosts, including procaryotes. yeast and 

20 mammalian cells. Live viral vectors, such as vaccinia , can also be used as carriers, and permit expression 
of the desired antigens along with the carriers' proteins in infected cells. Also included in the invention are 
hosts transformed with these expression systems and the proteins thus produced. The proteins produced in 
this way, or chemically synthesized to correspond to the deduced sequence, may be used as vaccines 
either alone, or in conjunction with carrier proteins which enhance their immunogenictty. In addition, the 

25 proteins may be used, either alone or conjugated with carrier, to elicit production of antibodies which are 
useful in diagnosis of carriers of the disease or in other immunoassays related to BDV. 

The invention also relates to methods for preparing these polypeptide vaccines and immunoglobulins, 
and to methods of using the materials thus prepared. 

30 Brief Description of the Drawings 

Figure 1 shows the map of overlapping segments of cDNA which, together, make up the entire BDV 
genomic sequence and cDNA fragments used to construct E. coli expression vectors. 

Figure 2 shows the complete nucleotide sequence for the BDV genome. The cDNA contains the 
35 identical sequence, except, of course, that T will be substituted for U. The deduced amino acid sequence, 
based on the open reading frame, and confirmed by expression of segments is also shown. 

Modes of Carrying Out the Invention 

40 A. Definitions 

As used herein, a nucleotide sequence "substantially identical" to the exemplified BDV genome refers 
to a sequence which retains the essential properties of the exemplified polynucleotide. A specific, but non- 
limiting example of such substantial equivalence would be represented by a sequence which encodes the 

46 identical or substantially identical amino acid sequence, but, which, because of codon degeneracy, utilizes 
different specific codons. Nucleotide changes are, indeed, often desirable to create or delete restriction 
sites, provide processing sites, or to alter the amino acid sequence in ways which do not adversely affect 
functionality. "Nucleotide sequence" refers both to a ribonucleotide and a deoxy ribonucleotide sequence 
and includes the positive sense strand, as shown, and the negative sense strand as well. 

so A DNA sequence "derived from" the nucleotide sequence which comprises the genome of BDV refers 
to a DNA sequence which, is comprised of a region of the genomic nucleotide sequence, or a combination 
of regions of that sequence. These regions are, of course, not necessarily physically derived from the 
nucleotide sequence of the gene, but refer to polynucleotides generated in whatever manner which have the 
same or "substantially identical" sequence of bases as that in the region(s) from which the polynucleotide is 

55 derived. For example, typical DNA sequences "derived from" the BDV genome include fragments encoding 
specific epitopes, fragments encoding portions of the viral polypeptide, sequences encoding the capsid 
proteins, sequences encoding deleted virions, and sequences encoding other useful viral genes. 
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"Recombinant host cells", "host cells", "cells", "cell lines", "cell cultures", and other such terms 
denoting microorganisms or higher eucaryotic cell lines cultured as unicellular entities, are used inter- 
changeably, and refer to cells which can be, or have been, used as recipients for recombinant vector or 
other transfer DNA, and include the progeny of the original cell transfected. It is understood that the 

s progeny of a single parental cell may not necessarily be completely identical in morphology or in genomic 
or total DNA complement as the original parent, due to accidental or deliberate mutation. Progeny of the 
parental cell which are sufficiently similar to the parent to be characterized by the relevant property, such as 
the presence of a nucleotide sequence encoding a desired peptide, are included in the progeny intended 
by this definition, and are covered by the above terms. 

io "Control sequence" refers to DNA sequences which are necessary to effect the expression of coding 
sequences to which they are ligated. The nature of such control sequences differs depending on the host 
organism: in procaryotes, generally such control sequences include a regulatory region promoter and 
ribosome binding site and termination signals; in eucaryotes, generally, such control sequences include 
promoters, terminators, and, in some instances, transcriptional enhancers. The term "control sequences" is 

15 intended to include, at a minimum, all components whose presence is necessary for expression, and may 
also include additional components whose presence is advantageous. 

"Operably linked" refers to a juxtaposition wherein the components so described are in a relationship 
permitting them to function in their intended manner. A control sequence "operably linked" to a coding 
sequence is ligated in such a way that expression of the coding sequence is achieved under conditions 

20 compatible with the control sequences. 

B. General Description 

At the center of the present invention is the provision of a nucleotide sequence containing the entire 
25 genome of bovine diarrhea virus* The availability of this complete polynucleotide permits the design and 
production of oligomeric probes for diagnosis, of vaccines effective against BDV, and of proteins useful in 
production of neutralizing antibodies. Sequencing information available from the genome allows the amino 
acid sequence of the polypeptide to be deduced, and locations of favorable epitopes surmised. Further, 
once the desired sequences are chosen, appropriate fragments of the genome can be obtained and 
30 expressed independently, thus providing desired polypeptides. Short polypeptide fragments may also be 
chemically synthesized and linked to carrier proteins for use as immunogens. Recombinants expressed 
polypeptides may be provided under conditions offering a favorable environment for processing into, for 
example, conjugation with cellular or artificial membranes which could thus bear the epitopic sites without 
the disadvantages of using an infectious virus. Mammalian and yeast cells provide suitable environments for 
35 such expression. In addition, the epitopes may be produced linked to a particle forming protein. 

The above proteins produced may, themselves be used as vaccines, or may be used to induce 
immunocompetent B cells in hosts, which B cells can then be used to produce hybridomas that secrete 
antibodies useful in passive immunotherapy and diagnosis. 

40 B.1. Nucleotide Sequence of the BDV Genome 

The genomic sequence of BDV was obtained from cDNA clones representing overlapping sections of 
the entire viral RNA genome (Figure 1). The viral RNA was isolated from virus grown on bovine embryonic 
kidney cells. The viral RNA was fractionated on sucrose gradients, and those fractions containing RNA of 

45 sufficient length to contain the intact genome were pooled, ethanol precipitated, and used to prepare a 
cDNA library. cDNA inserts were screened initially using a (+/-) system. Positive hybridizations were 
against RNA isolated from virus after lysis of infected ceils, negative hybridizations were against RNA 
isolated from uninfected cells. One insert having the proper + /-response was then used as a reference 
clone to map the remainder of the library. Several colonies hybridizing to the positive insert were used to 

so obtain additional portions of the viral genome from the cDNA library using "walking" techniques. Ten cDNA 
clones were obtained representing overlapping portions of the viral genome, as shown in Figure 1 . and were 
subjected to restriction mapping and sequencing. The entire genomic sequence was deduced from these 
ten cDNA inserts, and is shown in Figure 2. 

The illustrated DNA sequence and portions thereof are useful directly as diagnostic tools for detecting 

55 the presence of BDV in infected animals. These are particularly useful in distinguishing BDV infections from 
hog cholera virus. Methods to employ DNA hybridization in diagnosing disease have been disclosed in U.S. 
Patent No. 4,358,535 to Falkow. As set forth therein, biological samples may be used directly in obtaining 
Southern blots using suitable probes. Since the BDV genome is different from that of hog cholera virus. 
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specific portions of the BDV sequence may be used to detect the presence of corresponding complemen- 
tary sequences in biological samples from subjects suspected of harboring the infection. 

B.2. Preparation of Viral Polypeptide Fragments in E. coli 

5 

The availability of the entire genomic sequence permits construction of expression vectors encoding 
presumptively antigenically active regions of the virion proteins. Fragments encoding the desired proteins 
are obtained from the cDNA clones using conventional restriction digestion and ligated into a series of 
vectors containing polylinker sites in all possible reading frames to generate fusion proteins at the C- 
ro terminal end of 0-gaJactosidase. Eleven portions of the BDV genome were expressed as 0-gal fusions in E. 
coli using this approach, as outlined in Figure 1. These portions were obtained by restriction cleavage 
and/or ligation of the ten original clones, or the original cloned sequences were used directly. The fusion 
proteins thus produced may be immunogenic. 

is B.3. Preparation of Antigenic Polypeptides and Conjugation with Carrier 

Peptide regions representing epitopes can be synthesized using chemical or recombinant methods, and 
provided with, for example, cysteine residues at the C-terminus which provide means for linking the 
peptides to neutral carrier proteins. A number of techniques for obtaining such linkage are known in the art, 

20 including the formation of disulfide linkages using common reagents such as N-succinimidyl-3-(2-pyridyl- 
dithio)propionate (SPDP) and succinimidyM^r^aleimido-methyl)cyclohexane-lK^tx>xylate($MCC) ob- 
tained from Pierce Company. Rockford, Illinois. These reagents create a disulfide linkage between 
themselves and peptide cysteine residues in one protein and an amide linkage through the c-amino on a 
lysine, or other free amino group in the other. A variety of such disulfide/amide-forming agents are known. 

25 See, for example. Immun Rev (1982) 62:185. Other bifunctional coupling agents form a thioether rather than 
a disulfide linkage. Many of these thioether-forming agents are commercially available and include reactive 
esters of 6-maleimidocaproic acid, 2-bromoacetic acid, 2-iodoacetic acid, 4-(N-maleimido-methyi) 
cyclohexane-1-carboxylic acid, and the like. The carboxyl groups can be activated by combining them with 
succinimide or 1-hydroxy-2-nitro-4-sulfonic acid, sodium salt The foregoing list is not meant to be 

do exhaustive, and modifications of the named compounds can clearly be used. 

Any carrier may be used which does not itself induce the production of antibodies harmful to the 
subject, such as the various serum albumins, tetanus toxoids, or keyhole limpet hemocyanin (KLH). 

The conjugates, when injected into suitable subjects, result in the production of antisera which contain 
immunoglobulins specifically reactive against not only these conjugates, but also against fusion proteins 

35 carrying the analogous portions of the sequence, and against whole BDV. 

B.4. Preparation of Mammalian Cell Membranes Containing BDV Epitopes 

Portions of the cDNA library comprising the BDV genome were also ligated into expression vectors 
40 compatible with mammalian recombinant host cells; in the illustration below, into a mammalian/bacterial 
shuttle vector containing a linker sequence downstream of the SV40 early promoter, which is followed by 
the polyA sequence also derived from SV40. Alternate vectors to this particular host vector, pSV7d, could, 
of course, also be used. The mammalian-compatible vectors containing the coding sequences for the 
desired polypeptides are then transformed into suitable mammalian cells for expression of the sequences 
45 and, in the case of surface glycoproteins, transport of the produced protein to the membrane. The cells are 
ultimately harvested and used as whole cells in the formulation of vaccines, or the membranes are 
disrupted and portions of the membranes used correspondingly, or the proteins purified and formulated into 
vaccines. 

so B.5. Preparation of Hybrid Particle Immunogens Containing BDV Epitopes 

The immunogenicity of the epitopes of BDV may also be enhanced by preparing them in mammalian or 
yeast systems fused with particle-forming proteins such as that associated with hepatitis B virus (HBV) 
surface antigen (HBsAg). Constructs wherein a BDV epitope is linked directly to the particle-forming protein 
55 coding sequences produce hybrids which are immunogenic with respect to the BDV epitope, as welt as to 
HBV epitopes. 

Hepatitis B surface antigen has been shown to be formed and assembled in S. cerevisiae (Valenzuela 
et al, Nature (1982) 298:344-350. The formation of such particles has been shown to enhance the 
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immunogenicity of the monomer subunit. The particles can also be formed from constructs which contain 
the presurface (pre-S) region in addition to the mature surface antigen. The pre-S region encodes an 
immunodominant HBV epitope and these proteins are expressed in yeast (Neurath et al, Science (1984) 
224:392-394). Expression of constructs encoding pre-S region fused to particle forming protein are 

5 disclosed in European Patent Application 0 174 444. Expression of coding sequences for hybrid particles 
containing HBsAg and a heterologous epitope are disclosed in U.S. 4,722,840. These constructs may also 
be expressed in mammalian cells such as Chinese hamster ovary cells using an SV40-dihydrofolate 
reductase vector (Michelle et al, Int Symp on Viral Hepatitis (1984)). 

In addition, portions of the particle-forming protein coding sequence per se may be replaced with 

w codons for an BDV epitope. In this replacement, regions which are not required to mediate the aggregation 
of units to form immunogenic particles in yeast or mammals can be deleted, thus eliminating additional 
hepatitis B antigenic sites from competition with the BDV epitope. 

B.6. Vaccinia Carrier 

Large, wide host range virus carriers have also been used in formulating vaccines by integrating the 
epitopic regions of the desired immunogen into the carrier viral genome. Vaccinia virus, in particular, has 
been used for this purpose. For example. Smith, G.L, et ai, Proc Natl Acad Sci (USA) (1983) 80:7155-7159, 
disclose the integration of the hemagglutinin gene from influenza virus into the vaccinia genome and use of 

20 the resulting recombinant virus as a vaccine. Similarly, Panicali, D., et al, ibid (1982) 79:4927-4931, cloned 
the thymidine kinase gene from Herpes simplex virus into vaccinia The availability ofthe BDV genome of 
the invention offers similar opportunities. The recombination is generally done by co-infecting cells both 
with vaccinia virus and with a chimeric plasmid carrying the desired coding sequence under the control of 
the transcriptional regulatory signals and RNA start site from the vaccinia virus gene adjacent to a 

25 translation^ start site/foreign protein coding sequence. During infection the similarity in the flanking DNA 
sequences of the foreign DNA sequences to those in vaccinia causes integration of the desired portion of 
the chimeric plasmid into the vaccinia genome. The resulting recombinant vaccinia can be harvested from 
the infected cells and used in the formulation of a vaccine. Vaccinia virus has an extremely large (120 x 10 s 
dalton) genome, and may be very easily grown in culture. Hence, the production of large amounts of 

30 inexpensive immunogenic vaccine is readily possible. , ~v 

B.7. Preparation of Vaccines 

Preparation of vaccines which contain peptide sequences as active ingredients is also well understood 

35 in the art Typically, such vaccines are prepared as injectables, either as liquid solutions or suspensions; 
solid forms suitable for solution in, or suspension in, liquid prior to injection may also be prepared. The 
preparation may also be emulsified or the protein encapsulated in liposomes. The active immunogenic 
ingredient is often mixed with excipients which are pharmaceutical^ acceptable and compatible with the 
active ingredient. Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol. or the like 

40 and combinations thereof. In addition, if desired, the vaccine may contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants which enhance the 
effectiveness of the vaccine. The vaccines are conventionally administered parenterally, by injection, for 
example, either subcutaneously or intramuscularly. Additional formulations which are suitable for other 
modes of administration include suppositories and, in some cases, oral formulations. For suppositories, 

45 traditional binders and carriers may include, for example, polyalkaline glycols or triglycerides; such 
suppositories may be formed from mixtures containing the active ingredient in the range of 0.5% to 10%, 
preferably 1%>2%. Oral formulations include such normally employed excipitents as, for example, phar- 
maceutical grades of manitol, lactose, starch magnesium stearate, sodium saccharine, cellulose, magne- 
sium carbonate and the like. These compositions take the form of solutions, suspensions, tablets, pills. 

so capsules, sustained release formulations or powders and contain 10%-95% of active ingredient, preferably 
25%-70%. 

The proteins may be formulated into the vaccine as neutral or salt forms. Pharmaceuttcally acceptable 
salts, include the acid addition salts (formed with the free amino groups of the peptide) and which are 
formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids 
55 as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups may also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, 
procaine, and the like. 
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The vaccines are administered in a manner compatible with the dosage formulation, and in such 
amount as will be therapeutically effective and immunogenic. The quantity to be administered depends on 
the subject to be treated, capacity of the subject's immune system to synthesize antibodies, and the degree 
of protection desired. Precise amounts of active ingredient required to be administered depend on the 
5 judgment of the practitioner and are peculiar to each subject. 

6.8. Preparation of Mabs Against BDV Epitopes 



The immunogenic proteins or immunoconjugates prepared as described above may be used to obtain 
io peripheral blood lymphocytes and spleen cells in injected mammals to prepare hybridomas capable of 
secreting monoclonal antibodies directed against these epitopes. The resulting monoclonal antibodies are 
particularly useful in diagnosis, and, those which are neutralizing are useful in passive immunotherapy. 

C. General Methods 



The general techniques used in extracting RNA from the virus, preparing and probing a cDNA library, 
sequencing clones, constructing expression vectors, transforming cells, and the like are known in the art 
and laboratory manuals are available describing these techniques. However, as a general guide, the 
following sets forth some sources currently available for such procedures, and for materials useful in 
20 carrying them out 

C.1 . Hosts and Expression Control Sequences 

Both procaryotic and eucaryotic host cells may be used for expression of desired coding sequences 

26 when appropriate control sequences are used compatible with the designated host. Procaryotes are more 
useful for cloning; either procaryotes or eucaryotes may be used for expression. Among procaryotic hosts, 
E. coli is most frequently used, mostly for convenience. Expression control sequences for procaryotes 
include promoters, optionally containing operator portions, and ribosome binding sites. Transfer vectors 
compatible with procaryotic hosts are commonly derived from, for example, pBR322, a plasmid containing 

30 operons conferring ampicillin and tetracycline resistance, and the various pUC vectors, which also contain 
sequences conferring antibiotic resistance. The foregoing operons may be used as markers to obtain 
successful transform ants by selection. Commonly used procaryotic control sequences include the 0 
lactamase (penicillinase) and lactose promoter systems (Chang, et al, Nature (1977) 198:1056, the 
tryptophan (trp) promoter system (Goeddel, et al. Nucleic Acids Res (1980) 8:4057) and theTderived P L 

35 promoter and N gene ribosome binding site (Shimatake. et al, Nature (1981) 292:128). The foregoing 
systems are particularly compatible with E. coli ; if desired other procaryotic hosts such as strains of Bacillus 
or Pseudomonas may be used, with corresponding control sequences. 

Eucaryotic hosts include yeast and mammalian cell culture. Saccharomyces cerevisciae, or Baker's 
yeast and Saccharomyces carlsbergensis are the most commonly used yeast hosts, again because of 

40 convenience. Yeast compatible vectors carry markers which permit selection of successful transform ants by 
conferring prototrophy to auxotrophic mutants or by conferring antibiotic resistance or resistance to heavy 
metals on wild-type strains. Yeast compatible vectors may employ the 2 micron origin of replication 
(Broach, J., et al, Meth Enz (1983) 101:307) the combination of CEN3 and ARS1, or other means for 
assuring replication, such as sequences which will result in incorporation of the appropriate fragment into 

46 the host cell genome. Control sequences for yeast vectors include promoters for the synthesis for glycolytic 
enzymes (Hess, et al, J Adv Enzyme Reg (1968) 7:149, Holland, et al. Biochemistry (1978) 17:4900), and 
the promoter for 3 phosphoglycerate kinase (Hitzeman, et al, J Biol Chem (1980) 255:2073). For yeast 
expression, terminators may also be included, such as those derived from the enolase~gehe (Holland, M. J., 
J Biol Chem (1981) 256:1385). Particularly useful control systems include those specifically described 

so herein, which comprise the glyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter or alcohol 
dehydrogenase (ADH) regulatable promoter, terminators also derived from GAPDH, and, if secretion is 
desired, leader sequence from yeast alpha factor. These systems are described in detail in U.S. 4,876,197 
and US 4,870,008 

Mammalian cell lines available as hosts for expression include many immortalized cell lines available 
55 from the American Type Culture Collection, including HoLa cells, Chinese hamster ovary (CHO) cells, baby 
hamster kidney (BHK) cells, and a number of other cell lines. Suitable promoters for mammalian cells 
prominently include viral promoters such as that from Simian virus 40 (SV40) (Fiers, et al, Nature (1978) 
273:113) or other viral promoters such as the Rous sarcoma virus (RSV) adenovirus, and bovine papiloma 
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virus (BPV). Mammalian cells may also require terminator sequences. Vectors suitable for replication in 
mammalian ceils may include viral replicons, or sequences which insure integration of the appropriate 
sequences into the host genome. 

5 C.2. Transformations 

The transformation procedure used depends on the host to be transformed. Bacterial transformation 
generally employs treatment with calcium or rubidium chloride (Cohen, S. N., Proc Natl Acad Sci (USA) 
(1972) 69:2110, Maniatis. et al, Molecular Cloning: A Laboratory Manual (1982) Cold Spring Harbor Press, p. 
10 254). Yeast transformations may be carried out using the method of Hinnen, A., et aJ, Proc Natl Acad Sci - 
(USA) (1978) 75:1929-1933. Mammalian transformations are conducted using the calcium phosphate 
precipitation method of Graham and van der Eb, Virology (1978) 52:546, or the various modifications 
thereof. 

is C.3. Vector Construction 

Vector construction employs techniques which are by now quite well understood. Site-specific DNA 
cleavage is performed by treating with suitable restriction enzyme under conditions which generally are 
specified by the manufacturer of these commercially available enzymes (see. e.g., The New England 

20 Biolabs Product Catalog). In general, about 1 ixg of plasmid or DNA sequence is cleaved by 1 unit enzyme 
in about 20 ul buffer solution for an incubation time of about 1*2 hr at about 37* C. After incubation with the 
restriction enzyme, protein is removed by phenol/chloroform extraction and the DNA recovered by 
reprecipitation with ethanol. The cleaved fragments may be separated using polyacrylamide or agarose gel 
electrophoresis techniques, according to the general procedures found in Methods in Enzymology (1980) 

25 65:499-560. 

Sticky ended cleavage fragments may be blunt ended using E. coli DNA polymerase I (Klenow) in the 
presence of the appropriate deoxynucleotide triphosphates (dNTPs) using incubation conditions appropriate 
to the polymerase. The polymerase digests protruding 3* single strands, but fills in 5' protruding ends, 
according to the dNTPs present in the mixture. Treatment with S1 nuclease may also be used, as this 
30 results in hydrolysis of any single stranded DNA portion. 

Ligations are carried out using standard buffer and temperature conditions using T4 DNA ligase, and 
ATP; sticky end ligations require less ATP and less ligase than blunt end ligations. When vector fragments 
are used as part of a ligation mixture, the vector fragment is often treated with bacterial alkaline 
phosphatase (BAP) in order to remove the 5' phosphate and thus prevent religation of the vector; 
35 alternatively, restriction enzyme digestion of unwanted fragments can be used to prevent religation. 

Ligation mixtures are transformed into suitable cloning hosts, such as E. coli , and successful transfor- 
mants selected by, for example, antibiotic resistance, and screened for the correct construction. 

C.4. Construction of Desired DNA Sequences 

40 

Synthetic oligonucleotides may be prepared using an automated oligonucleotide synthesizer as de- . 
scribed by Warner. B. D., et al, DNA (1984) 3:401-411. If desired, these synthetic strands may be kinased 
for labeling with ^P by using an excess of polynucleotide kinase in the presence of labeled ATP, under 
standard kinasing conditions. 

46 DNA sequences including those isolated from genomic or cDNA libraries may be modified by site 
directed mutagenesis, as described by Zoller. M, et al, Nucleic Acids Res (1982) 10:6487-6499. Briefly, the 
DNA to be modified is packaged into phage as a single stranded sequence, and converted to a double 
stranded DNA with DNA polymerase using, as a primer, a synthetic oligonucleotide complementary to the 
portion of the DNA to be modified, and having the desired modification included in its own sequence. The 

so resulting double stranded DNA is transformed into a phage supporting host bacterium, and cultures of the 
transformed bacteria, which will contain replications of each strand of the phage, are plated in agar to obtain 
plaques. Theoretically 50% of the new plaques will contain phage having as a single strand the mutated 
form; 50% will have the original sequence. Replicates of the plaques are hybridized to kinased synthetic 
probe at temperatures and conditions which permit hybridization with the correct strand, but not with the 

55 unmodified sequence. The thus identified, desired, modified sequences are then recovered and cloned to 
serve as sources for the desired DNA. 
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C.5. Hybridization with Probe 



DNA libraries are probed using the procedure of Grunstein and Hogness (Proc Natl Acad Sci (USA) 
(1975) 73:3961). Briefly, in this procedure, the DNA to be probed is immobilized on nitrocellulose filters, 

5 denatured, and prehybridized with a buffer containing 0-50% formamide, 0.6 M NaCI, 60 mM sodium 
citrate, 0.02% (wt/V) each of bovine serum albumin, polyvinyl pyrollidine, and Ficoll, 50 mM sodium 
phosphate (pH 6.5), 1% glycine, and 100 ug/ml carrier denatured DNA. The percentage of formamide in the 
buffer, as well as the time and temperature conditions of the prehybridization and subsequent hybridization 
steps depends on the stringency desired. Oligomeric probes which require lower stringency conditions are 

10 generally used with low percentages of formamide, lower temperatures, and longer hybridization times. 
Probes containing more than 30 or 40 nucleotides such as those derived from cDNA or genomic sequences 
generally employ higher temperatures, e.g. about 40-42* and a high percentage, e.g. 50% formamide. 
Following prehybridization, this same buffer, now containing the K P kinased oligonucleotide probe, is added 
to obtain hybridization. Radioautography of the treated filters shows the location of the hybridized probe. 

75 and the corresponding locations on replica filters which have not been probed can then be used as the 
source of the desired DNA. 

C. 6. Verification of Construction and Sequencing 

20 For routine vector constructions, ligation mixtures are transformed into E. coli strain HB101 or other 
suitable host, and successful transform ants selected by antibiotic resistance or other markers. Plasmids 
from the transformants are then prepared according to the method of Clewell, D. B„ et al, Proc Natl Acad 
Sci (USA) (1969) 62:1159. usually following chloramphenicol amplification (Clewell, D. B. t J Bacteriol (1972) 
110:667). The isolated DNA is isolated and analyzed by restriction analysis, or sequenced by the dideoxy 

25 method of Sanger, F., et al, Proc Natl Acad Sci (USA) (1977) 74:5463. as further described by Messing, et 
al, Nucleic Acids Res (1981) 9:309, or by the method of Maxam, et al. Methods in Enzymology (1980) 
65:499. ~ 

D. Examples 

30 j 

The following examples are intended to illustrate but not limit the invention. The procedures set forth, 
for example, in Us D.1 and D.2 may, if desired, be repeated but need not be, as techniques are available for 
construction of the desired nucleotide sequences based on the information provided by the invention. 
Expression is exemplified in E. coli and in yeast, however other systems are available as set forth more 
35 fully in IC.1. Additional epitopes derived from the genomic structure may also be produced, and used to 
generate antibodies as set forth below. 

D.1. Preparation of cDNA 

40 D.1. a. Production of BVD Virus 

Bovine Embryonic Kidney cells (BEKI) ceils were grown in MEM (Earl's) containing 0.85 g/l NaHC03 
and 10% of irradiated fetal calf serum. The biologically cloned Osloss strain of BVD virus was passaged 5 
times through BEKI cells at a multiplicity of 0.1. Cytopathic effects, consisting of clustering of cells followed 

45 by vacuolation and then cell lysis, were readily observable from the first passage. Final titers (- 10 8 pfu/ml) 
were obtained after recovery of virus by freezing and thawing of infected cells. 

For the virus production, 175 cm 2 plastic flasks of subconfluent BEKI cells were used. The cells were 
washed 3 times with infection buffer (MEM (Earl's) + 2.2 g/l NaHCOa. pH 7.6) and then were infected with 
2 ml of BVD in infection buffer at a multiplicity of 0.05 pfu/cell. After 1 hr at 35* C, 18 ml of infection buffer 

so was added and the cells were incubated for 4-5 days at 35 *C, after which cytopathic effect (vacuolation 
followed by cells lysis) was greater than 80%. tn a typical production. 150 flasks of cells were infected. The 
medium (about 3 liters) was collected and stored at 4*C. The remaining cells were scraped in 2 ml of 
infection buffer/Mask, subjected to 3 cycles of freezing and thawing, and the final suspension was added to 
the infection medium. After a centrifugation at 10.000 g for 30 min. the supernatant was concentrated 10- 

55 fold by ultracentrifugation at 120.000 g for 4 hrs and 40 min at 4* C. 

Infectious virus had a density of 1.12 g/ml as measured by isopicnic banding in sucrose density 
gradient, and appeared as 45-55 nm spherical particles by electron microscopy. The virus preparations 
were neutralized by anti-BVD antiserum from rabbits injected with virus or from bovines. 
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D.1.b. Extraction and Purification of Viral RNA 

RNA was isolated from the virus pellet by the CsCI/guanidinium thiocyanate method as described by 
Chirgwin. et al. Biochemistry (1979) 18:3294, and the purified RNA stored in 70% ethanol at -20 'C. This 

s RNA preparation contained a large amount of contaminating low molecular weight cellular RNA and intact 
viral RNA. Viral RNA was further purified by sucrose density gradient centrifugation as follows: 

An aliquot containing an estimated amount of 5 ug of BVD-RNA was centrifuged at 10,000 g for 15 min 
at 4*C. The pellet was washed with 80% ethanol, denatured in 375 ul of 99% DMSO (99%). 5 mM Tris-HCI 
(pH 7.5) and incubated for 5 min at 37 *C. After addition of 1.125 ml of 5 mM Tris HCI (pH 7.5). 1mM 

w EDTA, 1% Sarkosyl. the solution was heated for 2 min at 70 *C and quenched on ice. This solution was 
distributed on 5x15-30% sucrose gradients in 5 mM Tris HCI (pH 7.5), 10 mM EDTA, 0.1 M NaCI, 1% 
Sarkosyl (in sterile siliconized Beckman SW40 tubes). A sixth gradient was loaded with 3* end labeled RNA 
as a marker (see below). After a centrifugation for 16 hrs at 19,000 rpm (20 # C). the gradients were 
fractionated (1 ml fractions). The RNA from each fraction of the gradient corresponding to that containing 

75 marker-labeled RNA was precipitated with 2.5 volumes of ethanol in the presence of carrier yeast RNA (10 
ug) and subjected to formaldehyde agarose gel electrophoresis, Lehrach, et al, Biochemistry (1977) 
16:4743, to determine which fraction contained the BDV-RNA band. Fractions corresponding to those 
containing the BDV-RNA, were pooled from the parallel gradients and precipitated with 2.5 volumes of 
ethanol, washed with 80% ethanol and stored at -20" C in 70% ethanol. 

20 The purified viral RNA was labeled with ^P-pCp (3000 Ci/n mol) according to England, et al, Meth 
Enzymol (1980) 65:65-74, and analyzed by agarose gel electrophoresis in the presence of 25~~M 
formaldehyde as described in Lehrach, et al, (supra). Fluorography was done with 3 H-Enhancer (NEN) as 
recommended by the manufacturer. 

The majority of the radioactivity was associated with low molecular weight RNA (less than 2 kb), but a 

25 small proportion was found in a high molecular band approximately 12.5 kb, identified as RNA by labeling 
properties with RNA ligase, its sensitivity to RNAse and alkali, and resistance to DNAse and proteinase K. In 
agreement with other reports on togaviruses of the flavivirus group, the BDV-RNA did not bind to oligo dT 
cellulose, showing either the absence of a polyA stretch at the 3' end, or that, if present, the polyA is 
extremely short. Control Sindbis virus RNA was properly retained by the same column. 

30 These properties of the 12.5 kb band were identical with those shown by RNA extracted from BEKI 
cells, grown as follows: 

BEKI cells were grown in 25 cm 2 plastic flasks, washed 3 times with infection buffer, and infected at 
multiplicities of 50-100 pfu/ceil with 1 ml of BDV solution. After one hour at 35* C ( 4 ml of infection buffer 
was added and the incubation was continued. After 12, 15, 18. 21 and 36 hrs (36 hr corresponds to a 

35 complete cycle of BDV replication), the newly synthesized RNA was labeled with 3 H-uridine (100 uCi/dish). 
Uninfected cellular RNA harvested after 18 hrs of incubation was also analyzed. After 30 min of labeling, the 
cellular RNA was extracted using the CsCI/guanidinium thiocyanate method of Chirgwin et al. 1979 (supra). 
The pellet of RNA, obtained after ultracentrifugation through a 5.7 M CsCI cushion, was directly analyzed by 
formaldehyde agarose gel electrophoresis and gel was dried and fluorographed. In all the incubation times 

40 tested, a 12.5 kb band which is absent in the uninfected cells could be detected which has the same 
physico-chemical properties as shown by the RNA above. 

D.1.c. Preparation of cDNA 

45 The viral RNA isolated from the virus in tD.l.b. was polyadenylated using the method of Sippel, Eur J 
Biochem (1973), 37:31-40. Briefly, the estimated amount of 0.7 ug of purified BVD RNA was incubatedlnS 
ml of 5 mM methylmercury hydroxide for 10 min at room temperature and incubated for 6 min at 37* C with 
20 units of polyA polymerase (BRL) and 500 uCi of 3 H-ATP (36 Ci/mmol, Amersham) in 50 ul of 50 mM 
HCI (pH 7.5). 10 mM MgCfe. 2.5 mM MnCfe, 0.3 M NaCI, 1.5 mM 2-mercaptoethanol and containing 2.5 ug 

so of RNAse-free BSA and 5 units of human placental ribonuclease inhibitor (BRL). After phenol/chloroform 
extraction, the RNA was purified by chromatography on Sephadex G50 and precipitated with 2.5 volumes of 
ethanol. The polyA RNA was used to prepare probes and as a template for the cDNA library. 

To make probes 1 ug of the polyA RNA was incubated for 10 min at room temperature in 5 ul of 10 
mM methylmercury hydroxide and then 45 min at 37 *C with 40 units of reverse transcriptase in 100 ml of 

55 50 mM Tris HCI (pH 8.3). 10 mM MgCI 2 . 1.5 mM 2-mercaptoethanol, 1 mM dATP, dGTP and dTTP, 10 uM 
dCTP, 0.2 mg/ml of actinomycin D. 5 units of human placental ribonuclease inhibitor. 500 uCi of alpha ^P- 
dCTP (3000 Ci/mmole. Amersham) and 20 ug of oligonucleotides obtained by partial digestion with DNAse 
I of calf thymus DNA (random primers). After 15 and 30 min. ten more units of reverse transcriptase were 
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added. After phenol/chloroform extration and Sephadex G50 column chromatography the RNA was 
hydrolyzed with 0.1 M NaOH (1 hr at 65 *C) thus yielding single stranded cDNA strands. The solution was 
neutralized with 0.1 M acetic acid and added directly to the hybridization buffer. 

For the cDNA library two separate cloning protocols involving dT (12-18) primers or random (calf 
5 thymus), DNA-derived oligonucleotide primers were used. RNA polyadenylated in vitro as described above 
was used. Approximately 1 ug polyadenylated RNA was incubated with 10 mM methylmercury hydroxide in 
a 10 ul volume for 10 min at room temperature, and excess reagent was titrated by adding 1 ul of a 3M 2- 
mercaptoethanol solution. This denatured polyA RNA was used immediately in the presence of 50 mM Tris 
pH 8.0. 1 mM dATP. dGTP, dCTP and dTTP t 2.5 ug/ml dT12-18 or the calf thymus random oligonucleotide 
10 primers, 10 mM MgCfe, 10 ug/ml actinomycin D, 100 units of RNAse inhibitor (BRL) and 60 units of reverse 
transcriptase in a total volume of 100 ul. 

The samples were diluted to 400 ul with a buffer containing 10 mM Tris pH 7.0, 100 mM NaCI, 10 mM 
EDTA and 0.2% SDS extracted with phenol/chloroform, freed of dNTPs by Sephadex G50 chromatography, 
and ethanol precipitated. 

is The precipitated mixture of RNA and cDNA hybrids (10 ul) were diluted into 50 ml of S1 buffer (500 
mM NaCI. 50 mM Na acetate pH 4.5 and 1 mM ZnCfe and digested for 15 min at room temperature with 20 
units of S1 nuclease. The reaction was stopped by diluting the sample to 500 ml with a buffer containing 50 
mM NaCI, 10 mM EDTA and 50 mM Tris pH 7.0, and digestion was continued for 15 min at room 
temperature by adding 20 ug/ml of RNAse A. After phenol and chloroform extraction, the RNArcDNA 

20 hybrids were concentrated by ethanol precipitation and fractionated on a Seph arose CL4B column prepared 
in a 1 ml plastic pipette. The excluded peak, containing molecules larger than 800 base-pairs, was pooled 
and ethanol precipitated to give 50 ng of hybrid for the dT primed, and 200 ng of hybrid for the random calf 
thymus fragment primed reactions. 

Both samples were tailed for dC residues under conditions yielding 15-25 residues per DNA or RNA 

25 termini, and annealed to a dG tailed pBR322 vector linearized at the Pstl site (NEN) at a vector 
concentration of 0.1 ug/ml. The annealed plasmids were transformed into E. coli HB101 to Amp R to obtain 
the cDNA library. 

D.2. Screening of the cDNA Library 

30 

Screening employed a +/- method using labeled cDNAs prepared from RNA isolated from uninfected 
BEKI cells (-probe) and from RNA isolated from the virus obtained after complete lysis of the cells ( + 
probe). Colonies of the E. coli harbored cDNA library were grown, lysed on nitrocellulose filters (two 
replicas) and probed. The hybridization buffer used for + probe also contained an excess of cellular RNA 

35 isolated from uninfected BEKI cells (10 mg/mi). The colonies which gave a clear signal with the + probe 
and no response with the - probe were selected. By this method, 95 oligo dT-primed and 185 random 
primer primed clones were selected. The length of the inserts after Pstl digestion varied from 400 to 4,000 
base pairs. No full-length virus specific cDNA was obtained. 

One of the clones, pDT28. with a 880 bp insert was selected for further analysis. This fragment from a 

40 Pstl digest of plasmid DNA was purified by acrylamide gel electrophoresis, digested with Ddel and Mbol 
and then labeled with the Klenow fragment of DNA polymerase I and the four ^P dNTPs to yield 10* -10* 
cpm/mg of insert. Labelled insert was verified by hybridization to viral RNA fractionated on a 0.9% agarose 
gel electrophoresis in presence of formaldehyde (Smiley, et al, Anal Biochem (1983) 131:365-372). 
Stringent hybridization conditions were used: prehybridizations and hybridizations were overnight at 42 * C, 

45 and 50% formamide was used in hybridizations. Washing was at 65* C first with 2xSSC, 0.1% SDS and 
then with 0.2xSSC and 0.1% SDS. 

In the foregoing verification, RNA from uninfected cells was used as negative control. The absence of 
exogenous viral sequences in the genome of the cells was verified by failure of cellular DNA digested with 
BamHI and EcoRI to bind to pDT28 probe in Southern blot analysis. The RNA from infected cells after 24 

so hrs of infection at a multiplicity greater than 1, and from the pellet of virus after complete cell lysis were 
used as positives. No hybridization was detected with the RNA from the uninfected cells, but the inserts 
hybridized to an approximately 13 kb band of the RNA isolated from the infected cells or from the pellet of 
virus. 

The plasmid pDT28, which had been verified to contain a Pstl insert which binds to the viral RNA, was 
55 used to probe the cDNA library for additional clones, and the entire sequence was recovered by "walking" 
techniques. In this way, eight additional plasmids were recovered which span the entire 12.5 kb genome of 
the virus. The positions of the overlapping inserts are shown in Figure 1 . As shown in Figure 1 , the pDT28 
clone occupies a roughly central portion of the genome. The 8 additional plasmids recovered from the 
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cDNA library in a manner analogous to that described above, but using the appropriate overlapping 
sequence-containing clone as probe, were grown in E. coli , and the plasmid DNA isolated. The inserts were 
sequenced, and verified to contain overlapping portions. The results of this sequencing are shown in Figure 
2. which provides the entire genomic RNA sequence ascertained from the inserts. 

5 The orientation shown in Figure 2 was determined by subcloning pDT28 into M13 into both orientations, 
labeling the resultant phage, and using the labeled phage as a probe against RNA known to be of positive 
polarity. This was done by spot hybridization on nitrocellulose filters using uninfected cell RNA, infected cell 
RNA, and template viral RNA. The infected cell RNA and template RNA should be of positive polarity. 
Therefore, the M13 orientation hybridizing to infected cell RNA and viral RNA contains a negative sense 

io strand, and from this information, the 5' to 3' sequence of inserts from pCT63 to pCT185 could be deduced. 
This conclusion was confirmed by analysis of the sequence of pCT63. which indicates its capability to 
form the expected hairpin structure at the 5' end. and by the absence of additional clones in the cDNA 
library having additional 5' sequences to that of pCT63. 

;5 D.3. Expression of Sequences Encoding flGal-BDV Fusions in E. coli 

Twelve portions of the BDV genome were obtained as follows: (1) the entire cDNA sequences per se. 
(2) products of restriction cleavage (with Pstl or BamHI or both) of the foregoing cDNAs, and (3) a ligated 
sequence obtained by ligating the pCT185 cDNA with a fragment of another. (See the table below.) These 

20 portions were used to encode the BDV portions of the fusion proteins. These eleven BDV protein encoding 
sequences were cloned into one of or a mixture of pUR290. pUR291. and pUR292, which contain restriction 
sites, e.g., BamHI and Pstl sites in all three possible reading frames with the 0-gal codons. so as to encode 
fusion proteins at the C-terminal portion of the 0-galactosidase protein (Ruther, U.. et.al. Em bo J (1980) 
2:1791-1794). Since all three possible reading frames are provided for the restriction sites used, the correct 

25 Feading frame in at least one of the vectors for the fusion protein is assured. Table 1 summarizes the 
vectors prepared and the BDV sequence contained in each. Nucleotide numbers are as indicated in Figure 
2. 



Table 1 

30 





Name 


pUR Parent 


BDV Insert Derived from 


BDV Nucleotides Contained in pUBVD 
Vectors (Numbers as in Fig. 2) 




pUBVDI 


pUR290 


pCT63 


1397-2607 


35 


pUBVD2 


pool 


pCT36 


2037-2574 




pUBVD4 


pUR292 


pCT183 


2955-4560 




pUBVDS 


pool 


pDT28 


5650-6450 




pUBVD6 


pUR290 


pCT174 


7225-10718 




pUBVD7 


pool 


pCT174 


-9500-10811 


40 


pUBVD8 


pUR292 


pDT65 


10442-10811 




pUBVD9 


pUR292 


pDT65 + PCT185 


10442-12470 




pUBVDIO 


pUR290 


pCT185 


11030-12457 




pUBVD11 


pUR290 


pCT185 


11405-12457 




pUBVD12 


pUR291 


pCT63 


597-1397 


46 


pUBVD13 


pUR290 


pDT28 + pDT17 


-6000—7800 



Each of the twelve cDNA sequences was mixed with T4 ligase in the presence of Pstl-digested 
mixtures of pUR290. 291. and 292 (or of one of these if the correct reading frame was deduced) and the 
ligation mixture transformed into E. coli strain D1210 (Lacl* mutant of HB101) to Amp". Successful 
transformants were confirmed by hybridization with labeled insert, and isolated plasmid DNA was analyzed 
by restriction analysis to confirm correct orientation. Expression was induced in successful transformants 
containing correctly oriented inserts by treating with IPTG (1 mM) on l-broth medium containing 40 ug/ml 
ampicillin. Three hours after induction, the cells were harvested, and lysed by sonication. The fusion 
proteins were produced as inclusion bodies, and the inclusion bodies were harvested by the method of 
Klempnauer, et a!. Cell (1983) 33:345-355, and stored at -20 *C suspended in 10 mM Tris (pH 8.0), 1 mM 
EDTA. Approximatelyl0-30 mg inclusion body proteins were obtained per ml of culture. 
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D.4. Characterization of the Fusion Proteins 

The fusion proteins were characterized as to their antigenic properties both in insoluble and solubilized 
forms. 

5 Inclusion body proteins solubilized in 1% SDS or 7 M urea followed by dialysis to a final concentration 
of 1 mg/ml are unreactive with sera from infected calves or from rabbits infected with purified virus. 

Preparation of Antisera. Both solubilized and unsolubilized inclusion bodies were injected into rabbits 
using peri-lymph nodaJ immunizations with 500 ug protein emulsified with Freund's complete adjuvant, with 
boosting every 4 weeks (IM injection of 500 ug emulsified in adjuvant) and bled 10 days after boost. Control 

io antisera were prepared from infected calves or from rabbits injected with purified virus. The antisera were 
tested for immunoactivity by ELISA and immunofluorescence, and by Western blot and immunoprecipita- 
tion. 

Western blot and immunoprecipitation yield complementary information with respect to reactivity. In 
immunoprecipitation, the native protein mixture is reacted with the test serum and the immunoprecipitate 
1$ subjected to SDS-PAGE. Therefore, immunoprecipitation assesses immunoreactivity with the native protein. 

However, in the Western SDS blot procedure, PAGE is performed before the antisera are tested for 
precipitation with the proteins on the gel. Therefore, Western blot assesses reactivity with denatured protein. 

The results of these procedures are given below. 

Results. The control antisera were immunoreactive with respect to proteins extracted from the virus 

20 pellet produced on BEKI cells, and showed immunoprecipitation with the 76 kD protein presumed to be the 
major antigenic component, as well as minor components presumed to be, at least in part, virion proteins 
having molecular weights of 36, 43, 47, 51 and 56 kO. No immunoprecipitation occurred when the control 
antisera were tested on Western blot Control antisera against infection thus react with antigens in the native 
protein, but not after denaturation. 

25 Immunoprecipitation and Western Blot. Most of the antisera formed in response to the fusion proteins 
were negative both in assay by immunoprecipitation and, like the control antisera, on Western blot. 

However, there were exceptions. The antiserum generated by fusion protein 7 immunoprecipitates the 
36 kD protein from BEKI-grown virus and reacts by Western blot to the 76 kD and 51 kD bands. Antiserum 
from fusion 5 immunoprecipitates 3 sizes of proteins: 64, 98, and 105 kD, sizes not precipitated by control 

30 antisera. Antiserum from fusion 9 precipitates a 58 kD band, also not precipitated by the control antisera. 
The significance of MW of the materials is not clear since it is not clear which, if any, of these proteins 
represent glycosylated materials with corresponding alterations in molecular weight. 

ELISA (carried out according to the procedure of Bartlett, et al, in Prot'des of the Biological Fluids , H. 
Peeters, ed.. Pergamon Press, Oxford, 1976. 24:767-770) used partially purified virus as antigen. Only the 

35 antiserum prepared against fusion protein 7 was positive at a 1:40 titer; serum prepared against fusion 
proteins 5 and 1 1 had titers of 1:4 and 1 :8, respectively. Nonimmune sera were negative. 

Immunofluorescence was conducted using labeled live or fixed infected cells. The antiserum prepared 
against fusion protein 11 was slightly positive in immunoreactivity with live cells; on cells fixed with 
methanol, acetone, or formaldehyde, serum prepared from fusion protein 7 gave the same strong response 

40 as control antisera from the infected animals, whereas antisera 5 and 3 were weakly positive against 
proteins extracted from the virus pellet produced on BEKI cells. 

Claims 

46 1. A nucleotide sequence encoding a bovine diarrhea related polypeptide which is a region of the bovine 
diarrhea virus genomic sequence shown in Figure 2. or a combination of regions of that sequence. 

2. A nucleotide sequence encoding a viral polypeptide substantially identical with that encoded by the 
bovine diarrhea virus genomic sequence shown in Figure 2. 

50 

3. A recombinant expression system capable, in a compatible host cell, of effecting the production of a 
bovine diarrhea virus related polypeptide which system comprises a DNA sequence of claim 1 being 
operably linked to a control sequence compatible with said host. 

55 4. The system of claim 3 which further includes upstream of said DNA sequence, and in reading frame 
therewith, a fused nucleotide sequence encoding a host protein or portion thereof. 
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5. The system of claim 4 wherein the fusion DNA sequence encodes an N-terminal portion of $• 
galactosidase. 

6. A recombinant vector which comprises the expression system of claim 3. 

5 

7. Recombinant host cells transformed with the vector of claim 6 or with a vector comprising the system 
of claim 4 or 5. 

& Polypeptide substantially identical with the entirety of the amino acid sequence as represented in 
to Figure 2, or with a region or combination of regions thereof. 

9. Polypeptide according to claim 8, which is further fused to a host protein or portion thereof. 

10. A vaccine effective against bovine diarrhea virus which comprises the polypeptide of claim 8, and 
75 pharmaceutical^ acceptable excipients. 

11. A vaccine according to claim 10. further containing an immunogenic particle, which particle comprises 
a polypeptide having an amino acid sequence capable of forming a particle when said sequence is 
produced in a eucaryotic host. 

20 

12. The vaccine of claim 11 wherein the particle forming amino acid sequence is derived from hepatitis B 
virus. 

1a The vaccine of claim 12 wherein the particle forming amino acid sequence is derived from HBsAg. 

25 

14. A method for preparing a polypeptide according to claim 8, which comprises culturing the cells of claim 
7 and recovering the recombinant polypeptide. 

15. A method for preparing an anti-bovine diarrhea virus vaccine which comprises the method of claim 14, 
30 and further adding pharmaceutical^ acceptable excipients. 

16. Use of a polypeptide according to any of claims 8 and 9 for the preparation of tests for the 
immunological detection of fusion proteins of the bovine diarrhea virus. 

35 17. Use of a nucleotide sequence according to claim 1 for the construction of oligomeric sequences useful 
as diagnostic probes. 

18. Use of a nucleotide sequence according to claim 1 for the preparation of a vaccine comprising as 
carrier a live viral vector permitting expression of a desired bovine diarrhea virus antigen along with a 
40 carrier's protein in infected cells. 

PatentansprUche 

1. Nukleotidsequenz, die den Code eines mit der Rinderdiarrhoe zusammenhSngenden Polypeptids 
45 enthait, und die ein Bereich der in der Figur 2 wiedergegebenen Genomsequenz des Rinderdiarrhoevi- 

rus, oder eine Kombination von Bereichen dieser Genomsequenz ist 

2. Nukleotidsequenz, die den Code eines Viruspolypeptids enthSIt, das im wesentlichen mit dem Viruspo- 
lypeptid identisch ist, dessen Code der in der Figur 2 wiedergegebenen Genomsequenz des Rinder- 

50 diarrhoevirus entspricht. 

a Rekombinierendes Expressionssystem, das in der Lage ist, in einer kompatiblen Wirtszelle die 
Produktion eines mit dem Rinderdiarrhoevirus zusammenhSngenden Polypeptids durchzufUhren, wobei 
dieses System eine DNS-Sequenz gemSfl Anspruch 1 umfaflt, die mit einer mit diesem Wirt kompati- 
55 blen Steuersequenz funktionsfahig verbunden ist. 

4. System gem8fl Anspruch 3, das auflerdem vor dieser DNS-Sequenz, und in dem zugehfirigen 
Leserahmen, eine verschmolzene Nukleotidsequenz enthSIt, die den Code eines Wirtsproteins oder 
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eines Abschnitts davon enthalt. 

5. System gemafl Anspruch 4, bei dem die Sequenz der Verschmelzungs-DNS den Code eines N- 
Endabschnitts von 0-Galaktosidase enthalt. 

6. Rekombinationsvektor, der das Darstellungssystem von Anspruch 3 enthalt. 

7. Rekombinationswirtszellen. die mit dem Vektor von Anspruch 6. oder mit einem Vektor. der das System 
von Anspruch 4 oder 5 enthalt. umgewandelt wurden. 

a Polypeptid, das im wesentlichen mit der Gesamtheit der Aminos2uresequenz, wie in der Figur 2 
dargestellt, oder mit einem Bereich oder einer Kombination von Bereichen davon identisch ist. 

9. Polypeptid gemSA Anspruch 8, das auBerdem mit einem Wirtsprotein oder einem Abschnitt davon 
verschmolzen ist. 

10. Impfstoff, der gegen den Rinderdiarrhoevirus wirksam ist. und der das Polypeptid des Anspruchs 8 und 
pharmazeutisch akzeptable Arzneitrager enthalt. 

11. Impfstoff gemafl Anspruch 10, der aufierdem ein ImmunitSt erzeugendes Partikel enthalt, wobei dieses 
Partikel ein Polypeptid umfaBt das eine Aminosluresequenz aufweist. die in der Lage ist, ein Partikel 
zu bilden, wenn diese Sequenz in einem eucaryotischen Wirt produziert wird. 

12. Impfstoff gemSfi Anspruch 11, bet dem die Aminosluresequenz, die das Partikel bildet, von einem 
Hepatitits B-Vlrus abgelettet ist 

13. Impfstoff gemSB Anspruch 12, bei dem die AminosMuresequenz, die das Partikel bildet, von HBsAg 
abgeleitet ist. 

14. Methode zum Herstellen eines Polypeptids gemMfl Anspruch 8, bei der die Zellen von Anspruch 7 
gezQchtet warden, und das Rekombinationspolypeptid zuriJckgewonnen wird. 

15. Methode zum Herstellen eines Impfstoffs gegen den Rinderdiarrhoevirus, die die Methode von An- 
spruch 14 umfaflt. und bei der auBerdem pharmazeutisch akzeptable Arzneitrager hinzugegeben 
werden. 

16. Verwendung eines Polypeptids gemSB irgendeinem der Ansprilche 8 oder 9 zur Entwicklung von Tests 
zum immunologischen Nachweis von Verschmelzungsproteinen des Rinderdiarrhoevirus. 

17. Verwendung einer Nukleotidsequenz gemafl Anspruch 1 zum Aufbau von oligomeren Sequenzen, die 
als Diagnosesonden nOtzlich sind. 

1fc Verwendung einer Nukleotidsequenz gemSfl Anspruch 1 zur Herstellung eines Impfstoffs, der als 
TrSger einen lebenden Virusvektor aufweist, der in Verbindung mit dem Protein eines TrSgers die 
Darstellung eines gewUnschten Rinderdiarrhoevirus-Antigens in inftzierten Zellen ermdglicht. 

Revendlcatlons 

1. Sequence de nucleotides codant pour un polypeptide apparent^ a la diarrhee bovine qui est une region 
de la sequence genomique du virus de la diarrhee bovine representee dans la figure 2 ou une 
combinaison de regions de cette sequence. 

2. Sequence de nucleotides codant pour un polypetide viral sensiblement identique a celui code* par la 
sequence genomique du virus de la diarrhee bovine representee dans la figure 2. 

3. Systems d'expression recombinant susceptible d'effectuer, dans une cellule-hdte compatible, la pro- 
duction d'un polypeptide apparent^ au virus de la diarrhee bovine, ce systeme comprenant une 
sequence d'ADN selon la revendication 1 qui est fonctionnellement liee a une sequence de contrdle 
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compatible avec I'hote pr§cite\ 

4. Systeme selon la revendication 3 qui comprend par ailleurs, en amont de la sequence d'ADN precise 
et en phase de lecture avec elle, une sequence de nucleotides fusion nee codant pour une prot£ine- 

5 note ou une partie de celle-ci. 

5. Sy steme selon la revendication 4, dans lequel ta sequence d'ADN de la fusion code pour une partie N- 
terminaJe de beta-galactosidase. 

io 6. Vecteur recombinant qui comprend le systeme d'expression de ta revendication 3. 

7. Cellules-hdtes recombinantes transform^es par le vecteur de ta revendication 6 ou par un vecteur 
comprenant le systeme de la revendication 4 ou 5. 

15 8. Polypetide sensiblement identique a la totality de la sequence d'acides amines telle que representee 
dans la figure 2 ou a une region ou une combinaison de regions de celle-o. 

9. Polypeptide selon la revendication 8, qui est par ailleurs fusionne* avec une proteme-hdte ou une partie 
de celle-o. 

20 

10. Vaccin efficace contre le virus de la diarrhee bovine qui comprend le polypeptide de la revendication 8 
et des excipients pharmaceutiquement acceptables. 

11. Vaccin selon la revendication 10, contenant par ailleurs une parti cuie immunogenique qui comprend un 
25 polypeptide ayant une sequence d'acides amines susceptible de former une particule lorsque la 

sequence precise est produite dans un hdte eucaryotique. 

12. Vaccin selon la revendication 11, dans lequel la sequence d'acides amines formant la particule est 
de*riv6e du virus de I'hepatite B. 

30 

13. Vaccin selon la revendication 12, dans lequel la sequence d'acides amines formant la particule est 
d^rive'e du HBsAg. 

14. Precede* de preparation d'un polypeptide seton la revendication 8, qui consiste a cuttiver les cellules de 
35 la revendication 7 et a recuperet le polypeptide recombinant. 

15. Precede de preparation d'un vaccin contre le virus de la diarrhee bovine qui comprend le precede de 
la revendication 14 en ajoutant par ailleurs des excipients pharmaceutiquement acceptables. 

40 16. Emploi d'un polypeptide selon I'une quelconque des revendications 8 et 9 pour ta preparation de tests 
permettant la detection immunologique de proteines de fusion du virus de la diarrhee bovine. 

17. Emploi d'une sequence de nucleotides selon la revendication 1 pour la construction de sequences 
oligomeres susceptibles d'§tre utilisees comme sondes de diagnostic. 



45 



18. Emploi d'une sequence de nucleotides selon la revendication 1 pour la preparation d'un vaccin 
comprenant comme vehicule un vecteur viral vivant permettant I'expression d'un antigene souhaite du 
virus de la diarrhee bovine conjointement avec une proteine du vehicule dans les cellules infectees. 



50 
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riCOftE 1 




A: 



B: 



Mftpplnt of 9 EDV cDKA eloees which spin the vhole |«now. Clone* -ere 
SriveS fro. oli.o dT pri-ed cDKA (PT clones) or fro* rse4o.ly primed 
cDHA usint celf thymo. oli$onucleotides (CT clones). Kimes of 
S; «K 2ipCt36; 3-pCr**; S-pTOS; 6-pW17; 

7-pC?174; -a-pHM; 9-pCX^83*j V *fCr4>* 

cDHA fragments used to construct expression vectors for E.coll, by 
fusion to the E.coli B-i«l**t MtdM « *•«•• 



17 



EP 0 208 672 B1 



FIGURE 2 
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Flgura 2 



I I 

ncol 



1 ATGT ATACGAGAATT T6CCT AACCTC6TATACATATT666CATTCTAAAAATAAATTA66CC 
CATATGCTCTTAAACGGATT6GAGCATATGTATAACCCGTAA6ATTTTTATTTAATCCGG 

SB *tut , 61 mstll, 

63 T AAGGGACAAATCCTCCT TAGCGAA6GCCGAAAAGA66CTA6CCATGCCCTTAGTA6GAC 
ATT CCCTGTTTAGGAGG AATCGCTTCC6GCTT TTCT CCGATCGGTAC6GGAAT CATC CTG 

100 nhal v 

123 T AGCAAAACAAGGAGGGT AGCAACAGTG6TGA6TTCGTT6GATG6CT6AAGCCCTG 

ATCGTTTTGTTCCTCCCAT CGTTGTCACCACTCAAGCAACCTACCGACTTC666ACTCAT 

183 CAGGGT AGTCGTCAGTGGTT CGACGCTTC6TGT6ACAAGCCTCGAG6TG^ 

GTCCCATCAGCAGTCACCAAGCT6C6AA6CACACTGTT CGGA6CTCCACGGT6CACCT6C 

223 Khol f 

243 AGGGCATGCCCACAGCACATCTTAACCTGA6CGGGGGTC6TTCAG6T6AAAG 

TCCCGTACGGGTGTCGTGTAGAATTGGACTC6CCCCCAGCAAGTCCACTTTCGCCAAATT 

246 iphl, 

303 CCAACCGCTACGAATACA6CCTGATAGGGTGCTGCAGAG6CCCACT6T^ 

GGTTGGC6ATGCTTATGTCGGACTATCCCAC6ACGTCTCCGGGTGACATAACGATGATTT 

334 pstl, 

MetGluLauI l«ThrA»nGluL»uL»uTvrLv»ThrTvr 
363 AATCTCTGCTGTACATGGCACAT6GAGTTGATTACAAAT6AACTTTTATACAAAACATAC 
TTAGA6ACGACAT6T ACCGTGTACC^ 

LysGlnLY«ProAlaGlYV«lGluGluProValTvrA*n61nAl«GlyAjp^ 
423 AAACAAAAACCCGCT6GAGTG6AGGAACCAGTATATAACCAA6CA66TGACCCTTTGTTT 
TTTGTTTTTGGGCGACCTCACCTCCTTGGTCATATATT6GTTCGTCCACT6G6AAACAAA 



483 



466 bst£2, 

GlyGluArgGlYValValHisProGlnAlaThrLguLvmLjuPpoHijL^ 
G6tGAGAGAG6AGTGGTTCATCCGCAGGC6ACGCTAAAACTGCCACATAAAA6A^ 
CCGCTCTCTCCTCACCAAGTAGGCGTCCGCTGCGATTTTGACGGTGTATTTTCTCCCCTC 

ArqGluValProThrA«nLauAlaSarLauProLY«Ar961yAapCy»ArgSerGly 

543 CGCGAAGTACCTACTAATCTGGCGTCTCTGCCAAAA^G^ 

6CGCTTCATGGATGATTAGACCGCAGA6ACG6TTTTTCTCCACT6ACGTCCAGCCCATTG 



603 



587 pstl, 

SerLy5GlyProValSerGiYll«TyrLauLymPpo6lYProL;uPh;TYrGlnA«p^ 
AGCAAGGGACCCGTGAGTGGAATCTACCTGAAACCG6G^ 

TCGTTCCCT6GGCACTCACCTTAGATG6ACTTTG6CCCC6GCAATAAGATGGTCCTAAT6 
Ly«6iyProValTyrHi5AraAlaProLau61uPh«PhaGlnGluAlaS;rM;tCy»Glu 

663 aaaggAcccgtctXtcatagXgctccattggagt.t^ 

tttcct6ggcagatagtatctcgag6taacctcaagaaagtccttcggagatacacactc 

6*2 tacl t 

ThrThrArqArqlle6lYAraValThrGlyS«rAmp61yLYat;uTvrHl^ 
ACAACTAGAAGbATT6GGA6AGTAACT66TAGT6AT6GTAAATTGTA^ 
TGTTGATCTTCCTAACCCTCTCATTGACCATCACTACCATTTAACAT6GT6TAAATACAC 

CY«IlaA*pGlYCy«il«ll«ValLY»SarAlaThrLY«TvrHi«^ 
T6CATAGATGGAT6CATAATAGTTAAGAGC6CCACAAAATATCATC^ 
ACGTATCTACCTACGTATTATCAATTCTCGCGGTGTTTTATAGTA6TTTTCCAT6A6TTT 

794 ava3, 



723 



783 



643 
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GiuGlyA! jThpAroLvsLvsGinGl nLy*ProA«pAr gLeu6?uLyf G I yAroNfltLv* 
903 GAA&GdGCGACAAGiAAGAi§CAACAAAAACCAGAtAGGCTG6 

: CTTCCCCGCTGTTCTTTCTTC6TTGTTTTTGGTCTATCCGACCTTTTCCCCTCCTAC7TC 

1 1 •ThrProL yiGluSwrGI uLy*A«pS«rLysThrLv«P roProAaoA taTlirt uoil 

963 TATTrlir It T TCTcIerr tttttSy^ 

TATTGAGGATTTCTCAGCCTCTTTCTATCATTCTGGTTTG6C6GTCTAC6ATGCTATCAC 



VtlAspG 
1023 GTAGATGG 
CATCTACCA 



t 




mmmmimmmssimm 

•«» mmmmmsmmmmm 

1140 bgM, 

CC6CATTTATCTTCA6AC6TACCCTA6ACC6CTCTCTTTTA6ACAT6TCCAC*«fi66A«6 
1290 b«ll, 

1323 

1327 ball, 1333 tthim, 1371 nhal, 

T6TTTAAT6TSTAC6ACG7CC6AS6TT8CWTACTCACCTTSTTMTAC 
1397 patl, 

ACCAT6TTATAACTT6GAACCTAACAABA6TACTTATTTTS6«TTCS^ 
14*9 aapl, 

CCAGTCGGTGGTTCCCTCACACGGCAATGTACGeCCATAra 

1563 ■ftffUgg^^ 

CATCATTGT6TTC6ATCCTT6TCG66GT6T6STAACT6TCC6ACSTTCTTTCC6TTCTT€ 

AA6A6GAAAC6TCCACACAACCAT6TTCCCG6AAC0TT6AAa^TTATCGACATTCACTA 

1663 STCCTGTTTAGXGAGCACGAfTlcACAnTGTSATTC^ 

CACGACAAATCTCTCGT6CTAAC6TGTTCACACTAAGTTCC61GCCGACTGATAGACCAT 

■ CT6CCCTACT6GTTAAGA6ATCTTTCACGGTCC6TTCCCTGGCICT 
1760 xbal, 1790 hp*1, 

kni t?c?£¥ A £?S i 2^*«»-y;Ly alauG 1 yLvaLy aLauGl uAanLyafiarLyaThrTrpPha 

03 IISSSItSsSASIIIWSiiACTAGGG^^ 

AACCCATCCGTC6AATTCTTTGATCCCTTCTTTGACCTTTT6TTCTCATTCT6TACCAAA 

1»15 aftll, '• . 

CCCC6TATAC6TCGGA6AG6GAT6AC6CTCCATCTTGCCTCC^ 
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ThrLysAinCy*ThrProA|*Cv»L»uProlyiA*nThrLysll»ValGl yProG lyAro 
19 23 ACAAAGAATTGCACCCCTGCCTGTTTACCAAAAAAT ACAAAGAT CGT TGGCCC CGGT AGG 
TGTTTCT TAACGTGGGGACGGACAAAT6GTTTTTT ATGTTTCTAGCAACCGGGGCCATCC 

PhvAvpThrAsnA 1 aG luA«pGtyLyt I !«L»uHimGl uhetG 1 yGl yHltLftuStrG lu 
1963 TTCGACACCAATGCGGAGGATGGTAAAATACTGCATGAGATGGGGGGCCACTTGTCAGAG 
AAGCTGTGGT T ACGC C T C C T AC CAT T T T ATGA CGT AC TC T AC C C CCCGGT GAAC AGT C T C 

VaIL*uL»uL»uSerVal Va 1 V« ILvuStx At pPhcA 1 aProG luThrAl «S«rVa 1 Va I 
2043 GTGCTACTACTCT C AGTGGT AGTGCTT TCCGATT T CGCT CCAGAGACAGCCAGTGTGGT A 
C ACGATGAT6AGAGT CACCAT CACGAAAGGC T AAAGCGAGGT CT CTGT CGGT CACACCAT 

TyrLvui 1 cLauH i s PheSar I 1 «ProG I nG I yH I sThrAtp I 1 «H1 «A*pCy »A« pLy ft 

2t03 tAtttaattctacatttctccatcccacaaggacacactgaCatacatgactStgaYaaa 
ataaattaagatgtaaagaggtagggtgttcctgtgtgactgtatgtactgacactattt 



2163 



2223 



2343 



AsnGI nLau As nLvuThrVa 1 G 1 yLeuThrThrA 1 aGluVal 1 lcProGlyScrVa 1 Trp 

aaccaactaaacctcaccgtaggactcacaacagctgaagtaatacctgggtcagtttgg 
tt ggttgatttggagtggcat cctgagtgt tgt cgactt catt atggacccagt caaacc 



2194 pvull, 2209 tthllll 



A»nL»uGI vLysTyrValCvftl 1 •ArgProAtpTrpTrpProTyrG luThrAI aThrPhs 

aatttgggcaaatatgttt6tataagaccagattg6tggccttatgagacagccacgttc 
ttaaac ccgttt at ac aaacat attctggtct aacc acc6gaat actct6tc6gt6caag 



22S0 bstKl 



L»uV*lPh«GluGluVi 161 v61nV*l Il»AraII«V«lL»uArqAlaL«uAraAftpL*u 
22*3 CT AGTGTT TGAAGAGGTGGGTCAAGTGAT CAGGAT AGT CTTGA6&6CTTT AAGaGATCT A 
G ATCACAAACTTCT CCACCCAGTTCACTAGTCCTATCA6AACTCCC6AAATT CT CTAGAT 



2306 belt, 2336 bglll, 

ThrArql 1 *TrpThrAl aA I aThrThrThrAlaPhsL«uVal CyftLauValLyftVa 1 Va 1 

acgcgCatttggaccgctgctacgactactgcattcctggtat6tctggtgaaggtggtg 
tgcgcgt aaacctggcgacgatgctgatgacgt aaggaccat aca6accacttccaccac 



ArgGlyGlnValLvuGlnGlyllcLtuTrpLaul IcLcul IvThrG I yAl aGInG lyLvu 

2403 agaggccaagtgttgcaaggcatactgtggttgatactcataacaggggcacaaggGctc 
t ct ccggtt cac aacgttccgtatgacaccaact atgagt at tgt cccc6tgttcccgag 

ProVal CysLysProGlyPhvTyrTyrAlal lcAlaCysAtnAtnGluI l«GlyProL«u 
2463 CCAGTTTGCAAACCCGGcT T T T ACTAC6CCAT AGCCAAAAAT AATGAGATC66CCCTCT T 
GGTCAAAC6TTT6GGCCGAAAAT6ATGCGGT ATCGGTTTTTATT ACT CTA6CC666A6AA 

G lyAl *ThrG t yLvuThr ThrGI nTrpTyrGluT yrScrAspGlyH«tArgL«uGlnAftp 
2523 GGGGCT AC6GGCCT CACCACT CAGT6UTATGAAT ACTCGGATGGGAT6C6bCT6CA6GAC 
CCCCGATGCCCGGAGTGGTGAGTCACCAT ACTT ATGAGCCTACCCTACGCCGACGTCCTG 

2S74 p»tt, 

ThrGlyV* 1 Valval TrpCysLymGlyGlyGlui 1 aLy *TyrL»u 11 #ThrCy«6 1 uArg 
2S63 ACGGG AGT TGT AGTGTGGT GT AAAGGTGGAGAGAT CAAAT AT CT AATTACATGTGAGAGb 
T6CC CT CAAC AT CACAC C ACATT TCCACCTCT CT AGTTTAT A6ATT AATGTACACTCTCC 

GluAl AArgTyrLtuA t a I I «L«uHi»ThrArgAl aLftuProThrSvrVal Va I PhcG 1 u 

2643 GAAGCCAGbT ATCTGGC CATTCTACACACGAGAGCCCT6CCGACGTCT6TAGTATTTGAA 
CTT CGGT C CAT AGAC CGGT AAGATGTGTGCT CT CGGGAC6GCTGCAGACATCATAAACTT 



2647 bttKl, 26S6 ball, 2664 aatll. 



Lysl !•! l«A»pGlyLy»Glu6)n61uAftpVAlVAl6luH»tA»pA»pA»nPh«61uL«u 
2703 AAA AT CAT AGATGGAAAAGAACAAGAGGACGTA6TGGAAATGGAT6A I AACTTTGAACT C 
TT TT AGTATCT ACCTTTTCTT6TTCTCCT6CATCACCTTT ACCTACT ATT6AAACTT6A6 

Giyl»uCy«ProCysA«pAlaLysPraL«uVal ArgGlyLy«Ph»AsnThrThrL«uL«u 
2763 GG T CT TTGCCCGTGTGATGCT AAACCCTTGGT AA6G66AAAAT7T AATACAACACTT CT6 
CCAGAAACGGGCACACT ACGATTT6G6AACCAT T CCCCT TTT AAATTATGTTGTGAAGAC 



2623 



AsnGI yProA I aPh»G t ntletVa I CyvProI 1*6 1 yTr pThrGl yThrVal 8«rL*uCys 
A ATGGuC CAGCCT T CCAGATG6TTTGCCCT AT AGGAT GbACAGGAACTGTGAGTCTGTGT 
TT ACCCGGT CGGAAGGT CT ACCAAAC6G6AT ATCCT ACCT6TCCTT6ACACTCAGACACA 

Hl*TrpS«rA»nLy*A»pThrL«uAl*M«tThrV*l VAlAraThrTyrLyftArgHlmAro 
2663 CACTGGTCCAATAAGGATACGTTAGCCATGACCGTTGTAC6AACATACAAGA6GCACA66 
GTGACCAG6TT ATT CCT ATGCAATCG6TACTGGCAACAT6CTTGT AT6TTCTCC6TGTCC 



2940 mtut , 



2943 



ProPh»ProPheArgGlnGlyCy»I l«ThrGlnLymV* t I 1»61 v6lyAftpL«uTyrAftp 
CCTTTCCCCTTTAGgCAAGGtTGCATTACCCAGAAAGTCATCGGeGGXGA|CTCTXCGAt 
GGAAAGGGGAAAT CCGT T CCGACGTAATG66TCTTTCA6T AGCCCCCT CT66AGATGCTG 
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Figure 2 

S003 T GTGCCTTGGGAG6GAAC TGGAC T TGTGT ACCGGGGGACAT ACT ACGAT ATGT AGATGGG 
ACACGGAACCCT C CCTTGACCTGAACACATGGCCCCCTGT ATGATGCTATACATCT ACCC 

PraVa lGluSerCysLysTrpCysGl yT y rLys PheHi sL ymSerG luG lyLauProHit 
$063 CCT GT CGAGT C T T GCAAGT GGToTGGTT ACAAGTT T CAT AAAAGTGAGGGT CTGCCACAC 
GGACAGCT CAGAACGTT CACCACACCAATGTT CAAAGTATTTTCACT CCCAGACGGTGTG 

PheProl 1 eGl yLysCysLysLeuLytAsnGluSerGlyTyrArgGlnVal AspGluThr 
3 123 TTCCCAATTGGCAAGTGCAAGCTGAAGAATGAAAGTGGCTACAGACAAGTA6ATGAGACC 
AAGGGTT AACCGTT CACGTT CGACTTCTTACTTTCACCGATGT CTGTT C AT CTACT CTGG 

SarCysAsnArgAspGlyVai A I a 1 1 eVa I Pr oTKrG I ySerVa 1 LyvCytLy* 1 1 *G 1 y 
3163 TCTTtCAACAGAGACGGTGTGGCTATAGTACCAACTGGTTCGGTGAAATGCAAGATAGGG 
AGAACGTTGT CTCTGCCACACCGAT ATCATGGTTGACCAAGCCACTTTACGTTCTATCCC 

AspThrValVa lGlnVal I leAl aMetAspAspLysLeuGI yProH«tProCy»ArgPro 
3243 GACACAGTGGTGCAAGT CAT AGCAATGGATGA T AAGCTAGGGCCT ATGCCTTGCAGACCA 
CTGTGT C AC CACGTT CAGT AT CGTTACCT ACT ATT CGATCCCGGATACGGAACGTCTGGT 

3301 ndet , 

TyrGluI 1 e 1 l«ProSer6 luGlyProVa 1 Gl uLysThrAl aCysThrPhvAsnTvrThr 
3303 TATGAAATCATTCCCAGTGAG6GGCCG6TAGAAAAGACG6CAT&TACCTTCAACTACACA 
AT ACTTT AGT AAGGGTCACTCCCCGGCCAT CTTTT CTGCC6T ACAT66AA6TT6ATGTGT 



3363 



3306 mnl , 

Ly«ThrLeuLy*A*nLysTyrTyrGluProArgA*pA»nTvrPh«GlnGlnTYrt1;tL«u 

aaaacattaaagaacaagtattatgagcctagSgat aattatttccaacaatacatgtta 
tttt6taatttctt6ttcataatactcggatccctattaataaaggttgttatgtacaat 

3390 avr2, 

Uy»Gly61uTyrGlnTyrTrpPh«A«pL«uGluIl«ThrA»pHi«Hi»AraA*pTvpPhe 
3423 AAAGGGGAGTACCAATATTGCTTTGACCTAGAGATCACTGAtCACCACCGfigAYTACTTC 
TTTCCCCTCAT66TT ATAACCAAACT66ATCTCTA6T6ACT6GTG6TGGCCCTAATGAA6 

3436 sspl, 

Al*GluS«rLeuLeuV«UlBV*lVjlAl»LBuL*u6lyGlYArgTvrV*lL»uTrpL 
3463 GCTGAGTCCCTACTGGTGATAGTGGTTGCACTCCT66GtG6TA6GTACGTGCTCT6GTTA 
CGACTCAGGGATGACCACTATCACCAAC6TGAGGACCCGCCATCCAT6CACGAGACCAAT 

LeuV»iThrTyrHetIleUeuS«rGluGlnMetThrS«rGlyArgProVjlTrpAU 
3543 CTGGTTACATATATGATCCTATCAGAACAAATGACCTCGGGACGf CCAGTATGGGCAG6T 
6ACCAATGTATATACTA66AT AGTCTT6TTT ACTGGAGCCCTGCAGGTCATACCCGTCCA 

3563 aatll, 

Glull»V«lM«tttet61yAsnLeuLeuThrHisAspSepll«GluyjlV;lThrTyrPh« 
3603 GAAATAGTGATGATCGGCAACCTGCTAACACATGACAGTATT6AAGT6GTGACTTATT^ 
CTTTATCACTACTACCCGTTGGACGATTGTGTACTGTCATAACTTCACCACT6AATAAAG 

LBuLeuLeuTyrLeuLeuLBuArgGluGluAsnIl»Ly»Lv*TrpVjill«L;ull«Tvr 

3663 TTACTACTATACCTACTACTAAGAGA6GAAAACATCAXaaXaTG6GT^ 

AATGATGATATG6ATGAT6ATTCTCTCCTTTTGT A6TTTTTTACCCAAT ATGAATATATG 

Hisnen«V*lMetHi«ProL«uLv«S«ryalT^ 
CACATCATAGTAATGCACCCACTAAXATCAGTGACGGTGATACT^ 

GTGTAGTATCATTACGT6G6TGATTTTA6TCACTGCCACTATGAC6ATTACCAACCTCCC 

HetAl*ArgAl*GluProGlyAl*GlnSBrPh«L;uGluGlny«lAjpL^ 

ATGGCAAGZcCAGAACCAGGtGCCCAGAGCTTCC.TAGAGCAGGTGGA^ 
TACCGTTCCCGTCTTGGTCCGCGGGTCTCGAAG6ATCTCGTCCACCT6GACTCAAAAAGT 

3601 narl t 

MetIleThrL»uIleV»lValGlvL«uV»Ul5Al;ArgArgA« 
3643 ATGATCACGCTCATTGTAGTAGGTCTGGTCATTGCCAGSCGCGACCCCAC^ 

TACTAGT6C6A6TAACATCATCCAGACCA6TAACGGTCCGCGCTGGGGTGACACCACGGT 

3644 bell , 3902 sp«l f 

L.uValThrlleValAlaAlaL.uArgValThrGlv^^ 
3903 CT AGTCACAAT AGTTGCAGCACTGAGbGT AACGGGACT'AGGCTTTG6^ 

GATCAGT6TTATCAACGTCGTGACTCCCATTGCC:TGATCCGAAACCCGG6CCTCACCTA 

3946 apal f 



3723 



3763 
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Figure 2 

Vj 1 Al ArtetAl av/a 1 LeuThrLeuThrLeuLeuMet 1 leSerTyrValThrA*pTyrPhe 
7 63 GT AGCTATGGCAGTCCTAACCTTGACCCTACTGATGAT7AGTTATGT6ACAGACTACTTC 
CATCGATACCGTCAGGATTG6AACTGGGATGACTACTAATCAATACACTGTCTGATGAAG 

Arq T yrL ysArqTrpLeuG 1 nCy* M eLeuSerLeu i 1 eA 1 aG 1 y Va I PheLeu 1 1 eApa 
023 AGGTACAAAAGGTGGCTACAATGTATCCTCAGCTTAATAGCCGGGGTTTTCCTTATACGA 
TCCAT6TTT T CCAC CGATGTT AC ATAGGAGT CGAATT AT CGGC CCCAAAA6GAAT ATGCT 

SerLeuLysHisLcuGl yG lul 1 eG 1 uThr Pr oG luL«uThrIleProA*nTrpArqPro 
083 AGCCTTAAACATCTGGGCGAGATTGAGACCCCTGAGCTGACCATACCGAACTGGAG6CCA 
T CGGAATTTGT AGACCCGCTCT AACTCT6GGGACTCGACTGGTATGGCTTGACCT CCGGT 

LvuThrPhe I \ oLcuLeuTyrLeuThrSerA t aTrvpVa 1 Va 1 Thr ArgTrpLy»Va I Asp 
1 A3 CTAACCTTCATACTATTGTACCTGACTTCAGCAACA6TT6TCACACGATGGAAAGTTGAC 
GATTG6AA6TATGATAACATGGACT6AAGTCGTTGTCAACAGTGTGCT ACCTTTCAACTG 

I leAlaGlyl l«L«uLeuG 1 nG 1 y ProG 1 nSerPhrCy *0P 
203 ATAGCTGGCATATTACTGCAAGGGCCCCAATCCTTCTGCTGATTGCCACCTATGGGCT 
T AT CGACCGT AT AATGACGTT C CCGGGGTT AGGAAGACGACTAAC6GTGGATACCCGA 

4224 apil, 

.261 GACTTCCTGACCCTTGT AT TGATCCTGCCCACCCACGAATTAGT CAAGTTGT ACT ACCTG 
CTGAAGG ACTGGGAACAT AACT AGGACG6GTGGGTGCTT AATCA6T TCAACATGAT6GAC 

.321 AAGACCGTCAAGACTGATGTGGAAAAGAGTTGGCTAG6GGGGGTGGACTACAA6ACAATT 
TTCTGGCAGTTCTGACTACACCTTTTCTCAACCGAT CCCCCCCACCTGATGTTCT6TTAA 

MetAspGluSerGlyGluGl yV*lTyrL»uPh»ProS«rLy»Gln 
V381 GGCTCTATTTATGATATGGATGAAAGTGGAGA6GGC6TGTACCTTTTCCCATCCAAACAG 
CCGA6ATAAATACTATACCTACTTTCACCTCTCCCGCACATG6AAAAGG6TAG6TTTGTC 

AsnGlvLysLy*AsnValSerl leLeuL«uProLBulleAraAl«ThrLeuil»S«rCy; 
*441 AATGGtAAGAAAAATGT CAGCATACTCTTGCCCCTCATTA6AGCT ACACTAATAAGCT6T 
TT ACCGTTCTTTTT ACAGTCGTAT6A6AACGGGGAGTAATCTCGATGTGATTATTCGACA 

4497 tthllll, 

1 1 eSer SerLv sTrpG I nMetVa 1 Tyrflet A 1 aTyrLeuThrL»uA*pPh;M;tT vrT vr 
4,501 AT CAGCAGCAAATGGCAGATGGTGTACATGGCTTACCTAACCTTGGACTTTATGTXCT AC 
TAGTCGTCGTTTACCGTCTACCACATGTACCGAATGGATTGGAACCTGAAATACATGAT6 

neHisApgUysVallleGluGlulleSerGlyGlyThpA^ 
4561 ATACACAGAAAGGTTATA6AAGAGATCTCA6GGGGCACCAATGT6ATATCTA66GT6ATA 
TATGTGTCTTTCCAATATCTTCTCTAGAGTCCCCCGTGGTTACACTATAGATCCCACTAT 

4583 bglll, 4605 ecorS, 

AlaAlaLeull«GluLeuAsnTppSerM»tGluGluGluGluSyp^ 

4621 GCAGCACTCAT AGAGCT AAACTGGT £T ATGGAAGAAGAAGAAAGCAAGGGC^ 

CGT CGTGAGTATCTCGATTTGACCAGATACCTTCTT CTTCTTTCGTTCCCGAATTTCTTC 

PhePhel leLeuSepGl yApqLeutysAl aLeul lei l«LvsHi«LysVaI ApgA*nGln 

4681 TTTTTTATACTATCTGGfeAGgTTGA^ 

AAAAAATATGATAGACCCTCCAACTTCCGGGAATATTATTTC6TATTCCAATCCTTGGTC 



4741 



ThrValAlaSwrTppTypGlyGluGluGluValTvrGW^ 
ACCGTAGCAAGCTGGTATGGGGAGGAAGAA6TCT ACGGCAT6CCA^ 
TGGCATCGTTC6ACCATACCCCTCCTTCTTCAGATGCCGTAC6GTTTTCATCACT66TAT 



4778 spht f 



IleAraAlaCY*SerLeuA«nLY;A«nLv|HisCY«Iljli;C^ 
4801 ATAAGGGCTTfeCTCACTAAACAjGAACAjGCATT^ 

TATTCCCGAACGAGTGATTTGTTCTT6TTC6TAACGTATTATACGTGTCATACACTCCGA 



4661 



4921 



4961 



Lv*Lv*TpoLv«G1 vG I vAsnCvsProLy*Cy*Gl YArqHi»GiyLy«Proll«ThpCy» 

aX6aXgtgSaX1gg«g?aact£ccct*Xat6c6^ 

TT CTT CACCTTCCCACC6TTGACGGGATTTACGCCGGCGGTGCCCTTC666TA6TSAACA 
4893 nwa3, 

GlyMetThrLeuAlaAspPheGluGluApgHijTypL^ 

ggSatgact ctagcggattttgaagagagScactacaagagaatttt^ 

CCCT ACTGAGATC6CCT AAAACTT CTCTCCGTGATGTT CTCTTAAAA6TATTCTCTCCCA 

ThrPheGluGlyProPheArgGlnGluHisSjrGlyPheV^ 

ACATTCGAAGGACCCT T C AGfiC AGGAAC AT AGCGGbTT TGT ACAAT ACACC6C^ 

TGTAAGCTTCCTGGGAAGTCCGTCCTTGTATCGCCCAAACATGTTATGTGGCGATCCCCT 
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3* J 



101 



161 



21 



281 



Figure 2 

GTTA^CAAGGACTr^ 

GT T AA C AAGGACT C CT T AAAT666T AT AACC6T T6GT7 TCATTTTT AC6AAT ACCA7 CCG 
5049 mstll, 

5i?r?ArrXr?i5 luI i eGI ¥ As eLeuGluHisL#uGl yTrpI 1 eLeuLy*M»tG I nil e 
TTGGATCCCcIt^S?!^^ 

TTGGATCCCCATCTTTAGCCACTAGATCTTGTGGATCCTACCTAGAATTTTTACGTCTAG 
5103 avr2 t 5124 xbil, 5133 avr2, 5156 bgill, 

TTCGTCAnlrTr^rlr 1 ^ 

AA6CACTTTTGGGACTGGCCGTTCTGGTA6TGGGACCTCCACCTCG6GTCACTGTGGTAG 
51fl6 bstXl, 

CTCTTGCACTTCCG6TTCTAG6TCCTATTCCTTCCGTAAGG6GGACTGGTCGTCTCCGA6 
5251 xmnl , 

Ii5x0f^liSll[ Ly ; GlnLeuGluA «e Gl Y Ar 9 s « rL «uS»rA«pTyrA5nIl«GlnLy« 
A ISIII§££ GG £ AAG £ AG SI GG ^^ 

TAGAAACGGCCGTT CGT CGACCTTCTACC6GCGA6AGAAAGACTAATGTT6TA6GTCTTT 
S287 na«l, 5296 pvull, 

5 LiSSSETiEtf uH i**-«uValL»uArgL«uArg6IyS»rGlyProAl*VaiCv*Lv«Lv« 

CTCAGCTGG6ACGTGGACCAGGAGGCAGACTCCCCATCACCCG6ACGGCACACGTTTTTC 
5343 salt, 

-*n« ii5l5551ayiS GluL r sC r« Hi « v *l A «ni^L«uAspLysLBuThrAlAPh«Ph«6ly 

a40t A II A HS5§S ATGAGAAATGCCATGTCAA ^^ 

TAATGACTCGTACTCTTTACG6TACAGTT6TATGATC7GTT7AACTGGCGTAAAAA6CCC 

)f?i";tProAraGlyThr7hrProArQAl4ProV*lLYSllePro7hrA 
5 61 5II?I G " AAGAGG i A £CACACCAAGS^ 

CAATACGGTTCTCCATGGTGTGGTTCCCGAGGCCACTTCTAAGGTTGGCGTAACGATTTT 

5473 kpnl, 

eo-i yii A £9 A £9 G iybsyS luThrGl * Tr e A ^^ 

CACTCC7CCCCTGACC777GGCC7ACCCGGA7GTG7GT6G77CC6CCGTAT7CGA67CA7 

A 5Et} i S^*l ThrAl * Gl ^ L r sA »e L « u, -e« v *lCy«AspS«rH«t61yArg7hrArgV»l 
5581 GACCATGTGACCGCAGGCAAAGACCTAC7GGT7TG7GATAGTA7GGG7AGSACAAGAGTG 
CTGGTACACTG6CGT CCGTTTCTGGATGACCAAACACTATCATACCCATCCTGTTCTCAC 

ValCvsGlnS«rAsnA*nLYsLeu7hrA»pGlu7hrGlu7YrGlyV*lLymThrA«pSer 
56M § J7TGCCAAAG7 AACAACAAG7T AAC7GATGAGACAGAA7A7GG7GTCAAGACGGAC7CC 
CAAACGG77TCAT7G77G7TCAAT76AC7ACTCTGTC77A7ACCACAGT7C76CC7GAGG 



341 



5701 



5761 



5661 hpal, 

GlyCysPraAspGl yAl aArgCysTyrVa 1 LeuAsnProG luA 1 aVa 1 Asnl 1 eScrGl y 
5S A l2lS£ AGA T GG TGCCA6gT6CTACGTATTAAATCCAGAGGCAGTAAATATATCAGG6 
CCTACA6GTCTACCACGGTCCACGATGCATAATTTAGGTCTCC67CA777A7A7AGTCCC 

SerUvsGlvAlaAlaValHisLeuGlnLysThrGLyGlyGluPheThrCysValThrAla 
TCCAAGGGAGCTGCTGTACACCTCCAAAAAACAGGTGGG6AAT7CACATGTGTTACT6CA 
AGGTTCCCTCGACGACATGTGGAGGTTTTTTGTCCACCCCTTA^GTGTACACAATGACGT 

5800 ccorl , 

.-w SerGl YThrProAlaPhePheA»pL*uLy«A»nL«uLysGlyTrpS*rGlyl.«uProIl« 
5821 T CGGGAAC 7 C C AG CCTTCTTTGACCTGAAAAATTT6AAGG6AT6GTCAGGT CT ACCCATA 
AGCCCT7GAGGTCGGAAGAAACTGGACT7TTTAAACTTCCCTACCA6TCCA6ATGG6TAT 

PheGluAl *SerSerGlyArqValValGlyArgValLysValGlyLy«AsnGluGluSer 

5881 TTTGAGGCTTCTAGTGGCAGGGTGGTCGGCAGAGTTAAAGTA6GAAAGAAT6AGGAATCC 
AAACT CCGAAGAT CACCGT C CC ACCAGCCGT CT C AATT T CATCCTTT CTT ACT CCTTAGG 

r . Lv*PpoThpLy«L«uri»tS»rGl vl l»GlnThrValS»rLy»S«rThrAlaA«pL»uThr 
5941 AAGCCCACAAAATTAATGAGTGGtATCCAAACCGTCTCAAAAAGCACAGCCGATTTAACA 
TTCGGGTGTTTT AATT ACT C AC CAT AGGTT TGGC AGAGTTT TT CGTGT CG6CT AAATTGT 
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t igurt 2 



toot 



9 1 IfMSXi^Y*^^* 1 l »Thr5«rnetA«nArqGlyA«pPh«LytGlnll«ThrL»tiAl a 

5*5**55* caXgaUataaccagcatgaacagGggXga^^ 

CTCTACCAGTrCTTCTATTGGTCGTACTTGTCCCCTCTGAAATTCGTcU 

€06t A CAGGGGCAGGAAAAACTACAGAACTCCCAAAGGCAGTGATAGAGGAGATA6^ 
TGTCCCCGTCCTTTTTGATGTCTT6AGGGTT TCCGTCACTA^ 

bl2l $iGCGGGTGCTAGT6CTTATACCATTGA6AGCA6CA6CTGAGTCA6tCTATC^^ 
"CGCCCACGATCACGAATATGGTAACTCTCGTW^ 

61SS pvutl, 61S6 tthlllt, 
4210 ftitll, 

TACCGTTGACCCTAGTGGATGC5GA6TATACCTATAAAAAC66TTTACG6CGTCGSCTTC 
6266 nd«l f 

t;4^J?ii*iif l, ?5tV*IGluTYrS«rTyrIl»Ph«L»oA«p€luTyrHi«Cy«Al«Thr 

GGACTCGTCAACC6ACAGTATCCTTTTTAA6T6TCTAAAAGAtTTtC«TATT££CACCAA 
CGATACT66CG6T6G6GTC6TCCCAGTCATTGATGTTGTCCC6TTTTT6T666TTATCTT 



6461 



6721 




6541 GCGGG 



6461 aeorl, 6493 Mtll, 
Al«Glyl«ut 



CGCCCCGATTTTTAGGGCCAACTCCTCTACTTCCCATTGTACGACCA6AAGCAT666TGT 
. M A £9 A * "?f iMf X!ii*B v * 1 A 1 *L jr»ty«L tulysA I *Ly*G 1 yTyrAifiScrGI yTyr 

6601 ^gaaacatggcagttgatgtagccaagaaactaaaagcc^ 

tctttgtaccgtcaactacatccgttctttgattttcggttcccgatgttgagtcccata 

atgat$tcacccctt€t6ggccgattgaactccca€cattgta6t6t€ag6g6tatgca6 

SIC5I5SS5iS£4i T6CCATT6A6TCA€G * €TAAC6CT6c CAGATTTAGAtACAGTTGTT 
CATCATCGGTGGTTACGGTAACTCAGTCCCCATTGCGACGGTCTAAATCTATGTtAACAA 

* 7M S a S a £ a S5»ctgaagt6tgaaaAgagG6T6ag8gtgtcatcaaaaatacctttcatagta 

CTGT6TCCAGACTTCACACTTTTCTCCCACTCCCACAGTA«TTTTTAT6*AAAGTATCAT 
ilA , I5r|ivLguLy«Aron«tAt«VAlThrV*lGlvGlueinA 

6641 ACAGGeCTTAAAAGAATGGCTGTCACTGT66GtGAACA€6CTCAGC6AA6A€GeA66GTA 
TGTCCGGAATTTTCTTACC6ACAGT6ACACCCGCTTGTCCGACTCGCTTCTCC6TCCCAT 

6043 ttut, 

ADfW GlyAray*lUyjPro61yArQTyrTyrAroS«rGlnGluThrA 

6901 6§T AGAGTGAaGCCCGGT AGStaCTATA6aAGCCAG6AAACA6CGaCCC€GTCAAAGGAC 
CCATCTCACTTCGG6CCATCCATGATATCTTCGGTCCTTT6TCGCTGGCCCAGTTTCCTG 

494S tthlllt, 

tnt * Ir^"i«Iy«'A*pU«uL»uGlnAI*Hl«AraTyrGWll«GluAipGlyll»AmnV#IThr 
6961 TACCACTXTGACCT6TTACAGGCACACAGGTATGGGATAGAAGATGGAATCAACGT6ACA 
ATGGTGATACTGGACAATGTCCGTGTGTCCATACCCTATCTTCTACCTTAGTTGCACTGT 

6973 tthllll, 7017 tthlllt. 
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T GTC6TTCATCACCTCTGACCCTTTC6T6ACTTCGTTCACCATCCT 



Figure i 

ta21 !^?7C CTTTAg2gaAATGa2t^ * 
r 2 l TTtAMAAATCrrTTlIrT?^^ 

T TCAtfoAAAT CCCTTT ACTT AATGCTAACCTC6GACATGCTCCTCCTGTCGAACGACT AT 

^ TGGGTC6ACCTCT ATGACTTGTTAGAT6AGTAGAGACTTCT66ATGGTCGTCGTCATTTT 

7064 pvult, 

M2li??Tr£itt??ItS5Sf8"i!C ro6luProll «G»nt»oAUTvrAtn5»rTyrGlu 

» TGTA6TACCGTTCCTGACTAGTGGGTCTTG6TTAGGTC6AACGTATGTTGTCAATACTC 
71S6 belt, 

* caggtccagSgacatgacaaaggtttttattccttaccc?™^^ 

* TTGATGAGTAAGGATTTACGTTCTTTTGATCCC^TCTACATG^ 

* TGGCTTCTACTTCTGGACCGTCATCTGGAAGATCCGAACCTGACCU 

L«uS«r 8»r A I * G 1 u 

. w« . «m*«CT6TCCTCTCCTGA6 
TCGTT CACCAT CCT6ACAG6AGACGACT C 

7440 bsal, 

A A«nA I «L«ut »u I 1 • A I *L«uPh«G 1 yTyrV* 1 6 1 y Ty rG 1 nAl «L»u8«rLy«AraMl • 

TTAC6G6AC6AGTATCGGGACAAACCCATACATCCTATAGTTCGAAACAGTTTTTCTGTG 
7475 VCOrS, 74*1 hindlll, 

$501 gtcccaatgatcacagacatatXcaccatagaagatcaaagactagaggacacaacccac 

CA666TTACTAGTGTCTGTATATGTGGTATCTTCTAGTTTCTGATCTCCT6TGTTGG6TG 
750ft belt, 

tS^§i?lTEi l * ProA,nAI * ll#Ap 9 ThrCiuC lX*-r»6 | « T, »«'«l*»'-*»»ty««lyLpu 

7561 ctccaatatgcacctaatgctataagaactgaggggaaggagactgaactaaXggaatta 
gaggttatacgtggattacgatattcttgactccccttcctctgacttgatttccttaat 

AI«V«lGlyA V pn*tABpAratl»n«t6luS*rll«S«rA«pTyrAU8«rGlyGlyL«u 
7621 GCAGTGGGTGACATG6ACAGAATCAT6GAATCCATCTCAGATTATGCATCAGGAGGGTTG 
CGTCACCCACTGTACCTGTCTTAGTACCTTAGGTA6A6TCTAATACGTAGTCCTCCCAAC 

7664 *y*3, 

ThrPhpX1«AroS«r6InAUGIuLypValAra6prAl«ProAliPh»LY«GluAsnVal 

/am acattcataagatctcaggcagagaXagtaagatctgcccctgcattcaaagaaaacgtg 
tgtaagtattctagagtcc6tctctttcattctagacg6ggacgtaa6tttctttt6cac 

7690 bajlt, 7711 bglll, 

GluAUAULy«GlyTyrVatGlnLy*Ph«Il«AspAlaLvull«GluAsAtyftGtuThr 
7741 GAAGCTGCAAAAGGGTACGTCCAAAAGTTTATT6ATGCTCTTATTGAAAACAAAGAAACC 
CTTCGACGTTTTCCCATGCAGGTTTTCAAATAACTACGAGAATAACTTTTGTTTCTTTGG 

ll«ll«AroTyrGlyL«uTrpGlyThrHlsThrAlaLsuTyri.y«S«rll«At«AlAAra 
7801 ATAATCAGATATGGCTTATGGGGAACACACACGGCACTTTACAAFA6TATT6CCGCAAGA 
T ATT AGTCTATACCGAAT ACCCCTTGT6TGTGCCGTGAAATGTTCTCATAACGGC6TTCT 

L.uGlyHi«61uThrAUPh«Al«ThrU»uV*in»Ly»TrpL«uM#Ph.GlvGlyGlu 

?661 CTGGGGCATGAAACAGCATT76CTACGCTAGTGATAAAGTG6CTAGCCTTCGGGG6T6AG 
GACCCCGTACTTTGTCGTAAACGATGCGATCACTATTTCACCGATCGGAA6CCCCCACTC 

7902 nhttl, 

PraV*lS«rA«pHisV*l AroG) nAt*ThrV*lA«pL«'uV« lV*lTyrTyrValH«tA«n 
7921 CCGGTGT CA6ATCATGTGAg'caGGCGACCGTTGACC 

GGCCACAGTCTAGTACACTCTGTCCGCTGGCAACTG6ACCAGCAAATAATACACTACTTG 

tOQL f » K I f f 4 
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Fiqure 2 

LysProSerPh»ProGt y AspS«r6 1 uThrG 1 nGl nGluG 1 y Acq Arq*Ph«v* I A 1 aSer 
796 1 AAACCCT CTT T CCCAGGGGAT T CCGAAAC CCAGCAGGAGGG&AGbC GAT T CG T TGCCAGC 
T T TGG6AGAAAGGGTCCCC TAAGGCTTTGGGTCGTCCTCCCCT CCGCT AAGCAACG6TCG 

6041 TTATTCA7 CT C CGCT CT GGCA ACCT ACACAT AC AAGA C T TGGAAC T AC C ACAACCTCT CC 
AAT AAGT AGAGGCGAGACCGTTGGATGTGTATGT TCTGAACCTTGATGGTGT TGGA6AGG 



6101 



LvsVa l VftlGluProAl ALeuAl *T yrLeuProTy r A 1 *ThrSerA I aLtuLy ftHetPhe 

aAggtagtagaaccagctttggcatXcctcccctAcgctaccagtgcactgaXaatgttc 
tt ccat cat cttggt cg aa ac cgta tggagggg at gcga tggt ca cgt gact t t t acaag 



61S1 K»nt 



ThrProThrArgLeuG luSerGluVall 1 «LcuS«rThrThr I 1 eTyrLy«ThrTyrL«u 
6161 ACCCC A ACT AGACTGGAGAGCGA6GTT AT ACT T AGCACT ACAAT AT ACAAAACTT ACCT C 
TGGGGTTGATCTGACCT CT CGCTCCAATAT6AATCGTGATGTTAT AT6TTTTGAATGGAG 

Serl l«AraLysGlyLy«ScrAspGlyt«uL«uGlyThrGlyll*S«rAlaAl«n«tGlu 
6221 TCAATAAG6aaGGGGAAAAGTGATGGACTCTT6GGTACAGGGATTAGTGCGGCAATGGAA 
AGTTATTCCTT CCCCTTTT CACTACCTGAGAACCCATGT CCCT AAT CACGCC6TTACCTT 

1 IcLeuScrGl nA«nProVa 1 S*rV« 1G1 y I leSerVa 1 He tLeuG 1 yVa IGlyAl *I l« 
6261 AT T CTGT CAC AGAACCCGGT AT CGGTAGGCATAT CT GT T ATGCTGGG&GTGGGbGCAATT 
T AAGACAGTGT CT TGGGCCATAGCCAT CCGT AT AGACAATACGACCCCCACCCCCGTTAA 

6264 tthllll, 

Al aAlaHisAsnAl *U lcGluS*rS«rGluG)nLysArqThrL»uL«uf1»tLy*ValPh« 
6241 GCCGCTCACAATGCCATTGAGTCTAGC6AACAAAAAAGGACCCTGTTGATGAAAGT6TTC 
CGGC6AGTGTT AC6GT AA"CTCA6ATCGCTTGTTTTTTCCTG6GACAACTACTTTCACAAG 

6391 n*nl, 

Va lLysAs nPh«T rpSerG I nA I aA 1 aThrAipG luLvuValLysG luAsnProG luLy* 
A4Q1 GTAAAAAACTTCTGGAGCCAGGCAGCAACAGAtGAATTGGTGAAGGAAAAT CCA6AAAAA 
° CATTT TTT GAAGACCTCGGT CCGT CGT TGT CT ACT TAACCACTTCCTTTTA6GTCTTTTT 

I lelleHttAl aL«uPh«GluAlaValGlnThrl leGlyA*nProL»uArgL«ull«Tyr 
IA ,. ATAATAATGGCCCTATTTGAAGCAGTTCAGACAATTGGTAACCCTCTGAGGCTTATATAT 
T ATT ATTACC66GAT AAACTTCGTCAAGTCTGTT AACCATT6GGAGACTCCGAATATATA 

6479 K«nl, 6497 bstE2, 

Hi*LeuTyrGlyValTyrTyrLysGlyTrpGluAl*Ly»61uL»uS«r61uArqThrAU 
.» 4 CACCTGTATGGAGTTTACTACA^AGGCTGGGAAGCAAAAGAACTATCCGAGAGGACAGCA 
" a1 GTGGACAT ACCTCAAATGAT6TTTCCGACCCTTCGTTTTCTTGATAG6CTCTCCTGTC6T 

GlyArgAsnL«uPheThrL»uIlenctPheGluAl*PheGluLeuL«uGlyn«tAspS«r 
m*m* GGtAGGAACCTGTTCACTTTGATAATGTTCGAAGCTTTCGAACTGTTAGGGATGGACTCT 
6561 C CGTCCTTGGA CAAGTGAAACT ATT ACAAGCTTCGAAA6CTTGACAAT CCCT ACCTGA6A 

6S66 Mini, 6612 hindlll, 

GluGl ytysl 1 •ArqAsnLsuSerGI yAsnTyrl lcLeuAspteuI leTyrScrLvuHis 
GAAGGGAAGATAAG&AACCTGTCTGGAAATTATATCTTG6ATTTGATCTATAGTTTACAT 
6641 CTT CCCTTCT ATTCCTTGGACAGACCTTTAATAT AGAACCT AAACT AGATAT CAAATGTA 

LysGlnl 1 eAsnArgS«cL«uLy*LysVa 1 Va lLeuG 1 yTrpAlaProAl aProPheScr 

M AAACAGATAAACA6AAGCTTGAA6AAA6TG6TCCTGGG6TGGGCTCCCGCACCTTTTAGT 

6701 TTTGT CTATT TGTCTTCGAACTTCTTTCACCAGGACCCCACCC6AGG6CGTGGAAAATCA 

671S hind111 v 

* 

Cy*AspTrpThrProSerA*pGluArgIl»ArgLeuProThrA»pAanTyrL«uArgVal 
TGTGACTGGACTCCTAGTGATGAGAGAATTAGGTTACCCACAGACAACTATCTAAGAGT6 
6761 ACACTGACCTGAGGATCACTACTCTCTTAATCCAATGG6TGTCTGTT6ATAGATTCTCAC 

6792 bstE2, 

GluThrLv*CysProCy«GlyTyr61uM»tLysAl*L«uArgA*nV* IScrG lySerLeu 
GAGACTAA6TGCCCATGTG6TTATGAGATGAAAGCACTAAG&AAC6TTAGTGGCAGTCTT 
6621 CTCTGATTCACG6GTACACCAAT ACTCTACTTTCGTGATTCCTT6CAAT CACC6TCAGAA 

Thrl l«V*lGluGluLy*GWProPh«L«uCy»ArdA«nArgProGl yArgGlvProVal 

_._ 4 actatagtggaaga6aaagg6ccttttctct6tagZaacag8cctg^ 

a * ai tgatatcaccttctctttcccggaaaagagacatccttgtccggaccatctcccggtcaa 

6920 stul v 6936 hpal , 

A*nTyrArgValThrLy*TyrTyrA*pAipA»nL;uAt*GluIl»Lv«Pr^ 

AACT AT AGAGT T AC A A AAT ACT ATGA I GACAACCT CGCAGAGAT AAAGCCAGTTCGAaGA 

ttgatatctcaatgttttatgatactactgttggagcgtctctatttcggtcaagcttct 
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9001 



9061 



figure 2 



G I yLy % 
GGCAaG 
CGTTC 



LtuGluu I yLtuv i !G1 uHi «TyrTyrly«Gl y V* I Thr Al a Aral l»A»nTvrG 

ct agaaggact cg t ggagc act att acaaaggt gtcacagcaa6^ 

GAT CTTCCT6AGCACCT CGTGATAAl GTTT CCACAGTGT CGTTCCTAT CT AAT AC 

GlyLy«H«tLf uLvuAl AThrAcpLysTrpGluValGluHitGl vl l«ValThrArnL0u 
S$ AAAAA ™£ISI TA S£^ 

CCTTTTTACGACAATCGGTGACTATTTACCCTCCACCTCGTGCCATA6CATTGATCCAAC 



9121 



9161 



9241 



9301 



9361 



9421 



9481 



95M 



9601 



9661 



A t «Ly«Ly*TyrThrGl yV* 1GI yPhvLyt GlyAlaTyrL«uGl vAvDGluProA«nHi« 

CGCTTCTTCAT6TGACCACAACCCAAGTTCCCTCGTATGGACCCACTGCTCGGGTTG6TG 

AroAspL*uValGluAroA«pCy»Al AThrIl»Thrty«A»nThrValGInPh«LauLvm 
SSISi£JlaSISS* A 5 € * CA "* T * M ^ 

GCACTG6ATCACCTTTCTCT6ACACGTT66TATTG6TTTTTATGTCAAGTCAAAAACTTT 

SIf£ A S A i A S«I$ T SS ATTT ^^ 

TACTTCTTTCCGACACGTAAATGGATACTGAACAGGGACAG6TTAAACT6GTCCAATTAA 
G luL*uV*lHisLyftA»nAsnL«uGluGluUy»A«ol laProAl AAlflThrLeuThrThr 

CTTAACCACGTGTTTTTATTGGAACTTCTCTTTCTGTATG6TC66CGGTGTAATTGTTGT 

CvmL«uAI«TyrThrPh«VAl AsnGluA«pll«61yThrll«Cy«ProV*lLtuGlyGlu 
I5CCTA6CTTACACATTTGTSAAT6AAGAtATC666ACTATAAAACCA6^ 

ACGGATCGAATGTGTAAACACTTACTTCTATAGCCCTGATATTTTGGTCATGACCCCCTC 
9386 acorS, 9408 «c*l , 

A S:9^*I lleAl * Af e ProV * lv * IA «eI l» A »^«u61nProGluV»lGlnV*lAtpThr 

AgAGTGATAGCCGAtCCAGTGGTAGACATTAACTTACAACCAGAAGTGCAGGTGGA^ 

TCTCACTATC6GCTGGGTCACCATCT6TAATTGAATGTT6GTCTTCACGTCCACCTAT6T 

S«rGluyalG!yll«ThrLcuV«161yArQAUAlAL«ijn«iThrThrGlyll»ThrPro 
TCAGA6GTT66GATCACTCT66TTGGAA6AGCAGCCTTGAT6ACAACA66TATTACACCC 
A6TCTCCAACCCTA6TGA6ACCAACCTTCTCGTC66AACTACTGTTGTCCATAATGTGGG 

X5iXfl§! uL t» T t p f l SC r 2 A * ftAl » A »B 6l r s *E' ro8 * r S»rll»Ly«Il»6lyL«u 

SIKIISW45^£ftSiS££ T M TCC S CA ^ C6tA6TCCAAftcTCTATAAA ^ T T6GXCTG 

CACCAACTTtTTTGTCT CGGATTACGGCTACCGTCAGGTTCGAGATATTTCTAACCTGAC 

5 A l5ti§€AT6TTACCCAGG6CCTA6ACCGCAA6AtCACACTTTAGCT6AC6AAATACAT 
CTGCTTCCTACAAT6GGTCCCGGATCT6GCGTTCTG6TGTGAAATCGACTGCTTTATGTA 

§5E A !:5li i e ClaAr 9 ProPh « v * lLeuV * l *-^ Gl y s »«' Ap «8«rS»rH»tS«rAmnArq 
TCTAGGGATGAAA6GCCCTTTGTTTTGGTCTTGGGTTCAA6AAGTTCCATGTCAAATAGA 
A6ATCCCTACTTTCC6GGAAACAAAACCA6AACCCAA6TTCTTCAAG6TACA6TTTATCT 



Al«LytThpAl*ArqA«nI 1 «A«nCy«ThrG 1 nLy«AroProGln6lul lvAraAspLau 
9721 GCAAAAACTGCTAGAAACATCAACTGTACACAGAAAAGACCCCAGGAAATTAGAGATCTG 
C6TTTTTGAC6ATCTTTGT A6TTGACATGTGTCTTTTCTGGGGTCCTTTAATCTCTAGAC 

9774 bQlll, 

M»tAl*GlnGlyArgf1etU«uV«lV») Al*LtuApqS«rPh«A»nPro61uL»uS»rGlu 
9781 ATGGCACAAG6GCGTATGCTAGTAGTG6CTTTAAGAAGTTTCAATCCTGAGTTGTGTGAA 
TACCGTGTTCCCGCATACGATCATCACCGAAATTCTTCAAAGTTAGGACTCAACAGACTT 

9*40 spal, 

L«uV«lA«pPh*LY»GlyThrPhvt«uAftpAr9VAlAl#U»u6luAl«U«uS«rL«uGly 
9641 CTAGTTGATTTCAAGGGGACTTTCTTGGATAGGGTTGCCTTG6AAGCCCTTA6CCT6GGG 
GATCAACT AAA6TTCCCCTGAAAGAACCT ATCCCAAC66AACCTTCGGGAATCGGACCCC 

9900 bgll, 

ProGlyAraProLyt61nVAlThrThrAlaThrV*lLyttGluL»uL»uGlu61n61uGlu 
9901 CCG6GAAGGCCCAAGCAGGTAACCACAGCCACAGTTAAGGAGTTGCTAGAGCAAGAG6AA 
6GCCCTTCCGG6TTCGTCCATTGGTGTCGGTGTCAATTCCTCAACGATCTC6TTCTCCTT 

9918 b«t£2. 



GlnV«lGluXl«ProAftnTrpPh*GlyAlAAspA«pProValPh»L«uGluVAlAl*t«u 
9961 CAAGTCGAGATCCCCAACTG6TTCGGT6CGGATGACCCAGTCTTCTTGGAAGTAGCTCT6 
GTTCAGCTCTAGGGGTT6ACCAAGCCACGCCTACTGGGTCAGAAGAACCTTCATCGAGAC 

9994 tthllli. 
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10021 



10031 



LytGIyAspLytl yrMl sUuVa I Gl yAspV/a 1 A«pLy*Va ILyftAspG 1 nAl aLytGl y 
AAGGGTGAC AAA T ACCACT T AGT AGGTGAiGT AGAi AAAGT AAAAGAT CAAGCAAAGGGA 
TTCCCACTGTT I ATGGTGAAT C AT CCACT ACAT CT A TT T CATTTT CT AGTTCGT T TCCCT 

LeuGlyA I aThrAspGl nThrArgl 1 tV* I L y * G 1 u Va 1 G 1 y A I a Ar g Thr T y r Th rh« t 
CT AGG&GC C ACGGACC AAACT AGAAT AGT AAAAGAAGT AGGiGCGAGAACC TACACAATG 
GATCCCCGGTGCCTGGTT TGATCTTAT CATT T T CTT C AT CCACGCT CT T GGATGT6TT AC 



10141 



10201 



10261 



10321 



10361 



10441 



10501 



10561 



10621 



LysLeuSerSerTrpPheLeuGl nAl aSerS«rLy«G I nM«tS»rL»uThrProL»uPh« 

aagctgtctagtiggtttcttcaagcatcaagtaaacagatgagcttgacccctttgttc 
ttcgacagat caac caaagaagtt cgt agtt c atttgt ctactcgaactg6g6aaacaa6 

G 1 uG 1 uLvuLauLeuArgCysProProLysri* tLysAftnAsnLy«61yHl«l l«Gly8«r 
GAGGAACTGTTGCTTCGT TGCCCTCCCAAGATGAAGAACAATAAA666CATATCG6ATCA 
CTCCTTGACAACGAAGCAACGG6AGGGTTCT ACTTCTTGTTATTTCCCGTATAGCCTA6T 

AlATyrGlnLeuA)aGln61yAftnTrpGluProL«uAspCy«GlyVAlHlsL«u6IyThr 
GCCTACCAACTAGCTCAGGGCAACTGGGAACCCCTCGATTGTGGAGTACACCTGGGCACC 
CGGATGGTTGAT C6A6TCCCGTTGACCCTTGG6GAGCTAACACCTCAT6TGGACCCGTGG 

I lcPraAl aArgArgValtysl IcHisProTyrGtuA 1 aTyrL«uLy«L«uLy«A«pL»u 
ATACCTGCC AGbAGGGTAAAGATCCACCCAT ATGAGGCCTATCT6AAACTGAA66ATTTA 
T ATGGACGGT CCTCCCATTTCTAGGTGGGTAT ACTCC66ATAGACTTTGACTTCCT AAAT 

10349 ndcl, 1Q3SS stul f 

Lau61uG1uGl uG 1 uArqLy t ProGl u6 1 yArgAcpThrVa 1 X i tArgGluHl sAsnLyi 
TTAGAAGAAGAAGAGAG5AA6CCAGAG6GTA6AGAtACAGT6ATAAGA6AACATAACAA6 
AATCTTCTTCTTCTCTCCTTCGGTCTCCCATCTCTAT6TCACTATTCTCTTGTATT6TTC 

Trpl 1 eLeuLysLysV* lArgProProArgLysProGlnTyrLytGluAsnProGlnPro 
TGGATCCTCAAAAAAGTGAGfiCCACCAAGfiAAACCTCAATACAAAGAAAATCCTCAACCC 
ACCT A6GAGT TTTTT CACTCCG6T66TTCCTTT6GA6TTAT6TTTCTTTTA6GA6TTGGG 

10442 banhl, 

TrpLysAlal 1 eArgA ) aThrArgLeuG luLysGl yl leLytGluThrSvrl 1*1 l»Thr 
TGGAAA6CT ATCAGAGCAACT AGACTA6AGAAGG6CATAAAAGAAACATCTATAATAACC 
ACCTTTC6ATA6TCTC6TT6ATCTGATCTCTTCCC6TATTTTCTTT6TA6ATATTATTGG 

LysLeuAlaSvrl leLeuThrGlyAlaGlyll«ArgL»u61uLy*L«uProValValArg 
AAATTGGCCTCCATACTAACA66T6CA6GAATAA66CT66AAAAATTGCCA6TC6TTAGA 
TTTAACCGGAGGTAT6ATT6TCCAC6TCCTTATTCC6ACCTTTTTAAC66TCAGCAATCT 

AlaGlnThrAspHi«Ly»S«rPh«Hl»GluAlaI l«ArgAftpLysIl«AspLysAmn61u 
6CCCAAACTGACCATAAAAGTTTCCATGA6GCAATCA6AGATAAGATA6ACAAGAAC6AA 
CGGGTTTGACT66TATTTTCAAA6GTACTCCGTTAGTCTCTATTCTATCT6TTCTTGCTT 



10661 



10741 



10601 



10861 



10921 



AsnGlnGlnSerProGlyLcuHiiAspLy«LcuL«uGlull«Ph«Hl«Thrll«Al«61n 
AAT CAGCAGAGCCCAGGATTACAT6ATAAATTGTTAGA6ATCTTTCACACAATAGCCCAA 
TTAGTCGTCTCG6GTCCTAATGTACTATTTAACAATCTCTA6AAA6T6TGTTATCGGGTT 

10718 bglll, 

ProSerLeuLysHisThrTyrGlyGluVa 1 ThrTrp61u61nL«u6luAlaGlyll»A*n 
CC CAGCCT AAAGCACACTT ACGGC6AAGTGACGTGbGAACA6CTTGAG6CAG6GATCAAC 
GG6TCGGATTTCGT6TGAATGCCGCTTCACT6CACCCTTGTC6AACTCCGTCCCTA6TTG 

ArgLysGl yAlaAla6lyPheLeuGluLysLy«AinLeu6ly61uValt»uA«pSer61u 
AGAAAAGGGGCTGCAG6CTTTCTAGAAAAGAA6AATCTTG6A6AA6TACT66ACTCAGAG 
TCTTTTCCCCGACGTCCGAAAGATCTTTTCTTCTTAGAACCTCTTCAT6ACCTGA6TCTC 

10811 pstl, 10621 xbal, 10645 seal, 

LysHisLeuVa lAspGlnLeul IvArgAspLeulytThrGl yArgLysl UArgTyrTyr 
AAGCACCTGGTGGACCAACTAATCA6AGALCTGAAAACAGGACGGAAGATAAGATATT AT 
TTCGT6GACCACCT66TTGATTA6TCTCT66ACTTTT6TCCT6CCTTCTATTCTATAATA 

GluThrAlall eProLy«A*nGluLyi Arg Ai pVa 1 S«rA«pAmpTrp6 InA 1 aGl yA»p 
GAGACAGCAATACCTAA6AACGAGAAGAGGGAT6TCAGTGACGATTGGCAA€CAGG6gaE 
CTCTGTCGTTAT6GATTCTTGCTCTTCTCCCTACAGTCACT6CTAACCGTTC6TCCCCT6 



10961 



11041 



I IcVal AspGluLysLysProArgVal 1 1 »G1 nTyrProGluAl«Uy«ThrArgL»uAl« 
AT AGT TGATGAAAAGAAACCAAGAGTGATT CAATACCCT6AA6CTAAGACAA6ACT66CC 
TATCAACTACTTTTCTTT66TTCTCACTAA6TTATGGGACTTC6ATTCT6TTCT6ACCG6 

11036 ball, 

1 l»ThrLysValM*tTyrA*nTrpVaIty»Gln61nProVaiyalll«ProG!yTyrGlu 
AT CACTAAAGTTATGT ACAACTGGGT6AAGCAGCAGCCT6TTGTGATCCCA66GTAT6AA 
T AGT G ATT TC AAT ACATGTTGAC CCACT T CGT C6TCGGACAACACTAG66TCCCATACTT 
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Figure 2 

Glyty*ThrProL»uPh«Ly* 1 l«Pn»A«nly«V* \ ArgtyiG 1 uT rpA*pL»uPh»A«n 
lllCft G6&AAG A CCCCAT TATTCAAGATCTT T AACAAGGT AAgXaAGGAATGGGACCTGTTC AA? 
CCCTT CTGGGGTAAT AA6T T CTAGAAATT6TTCCAT 7 CTTT CCTT A^CCTGGACA^ 

11120 bglH, 

6lu ?£°i!ii£ii v * l S*rPh»A*pThrLy*Al*TrpA«pThrGlnValThrS»rArflA«o 
11161 GAQCC AGT AGCTGTG AGTTT T GATACTAAGGCCTGbGAtACCCAAGTCACTAGTAG^CA? 
CTCGGTCATCGACACT CAAAACT ATGATT CC6GACCCT6T6G6TT CAGTGA^ 

11189 *tul, 11209 *p«1, 

L«uApqL«u 1 1 »GlyG lul 1 *G I nLy tT yrTyr T y r Ar gLy ftG 1 uTr pMl »Ly«Ph»l 1 • 
11221 " A «8CT7ATTGGt^ 

GATGCCGAATAACCACTTTAAGTTTTTATAATGAT6TCCTTCCTCACCGTGTTTAAGTAG 
11246 sspl, 1127ft CU1, 

AcpThr 1 IcThrAtpHi tHetVal GluVi 1 ProVtl 1 ItThrAI «A»dG1 vGIuVa lTvr 
1l2ai 6AfACCATCACCGA|cACATGGT€GAGGTACCAGTCATAA^^ 

CTATG6TA€TG6CTGGTGTACCACCTCCAT6GTCAGTATT6TCGTCTACCACTTCATAT6 

11307 kpfit. 

X l«AraA«nGl yGl AArgGl ySvrGl yGlnProAspThrStrAl iGl vA«nS»rH»tL»u 
11341 £TAAGAAATGGAC AAAGbGG < AGTGGCCAGCCAGA^ 

TATTCTTTACCTGTTTCCCCATCACCG6TC6GTCT6TSTTC6CGTCCATTGTCGTACGAT 

11364 b*l1 t 11393 »phl, 

AtnV*lL«uThrri»tn«tTyrAlAPh«Cyt61uS*rThrGlyValProTyrLYtG»rPh» 

11401 aatgtgttaacaatgatgtatgccttct&tgaaagtacgSg&gttccat^ 

TTACACAATTGTTACTACATACGGAA6ACACTTTCAT«CC^ 
11406. hpal, 

AsnArgV* 1 A I *Arg I l»Mi «V« 1 Cy*Gl yAspA»pG ly Ph*t«ul 1 »ThrGluAroG 1 v 
11461 AATAgXgT7GCAAG§A1CCAT6TC76T666GA 

TTATCTCAAC6TTCCTAGGTACA6ACACCCCTACT6CC6AA6GACTAT7GTCTCTCCCCC 

11474 b*»h1, 11476 bstXl* 
. « tSSSilCltrtt;? i;Sx»Si"5inArgA«pAUA«nPh»Cy*n«tAroApaAl aSvr&tr 

11521 ctgggcactaaaatttgccaacaaagSgatgcaaacti ct&catgagScgSgcaagctca 
6accc6tgattttaaacggttgtttccctac6tttgaagacgtactcc6ccc6ttcgagt 



11581 



AAAAATAACAGAA6S66AAAGAATGAAAGCT7€CC7A7A6GTTTGAG8CATA6AG7T7TG 
TTTTTATTGTCTTCCCCTTTCTTACTTTCGAACGGATATCCAAACTCCGTATCTCAAAAC 



11607 hlndlll 



UvuProHlftThrSerProArgLyftCyftliuIleMftPraAlaAlaThrTrpProValGly 

ii64i ctcccacacaccagtccccgtaagt&tctgataataccagcagctacatggccggtaggc 

6A6GGTGT6TGGT CAGGGGCATTCACAGACT ATTATGGTCGt C6ATGTACCG6CCAT CCG 

ThrAl«Ilcll«L«uSerLy«H«tAl«A«nLy*Il*61yL»u$«rGlyGluArgGlyThr 
11701 ACTGCCATTATATT AT CAAAGATGGCCAACAAGATTGGATTAAGTGGAGAGAGA6GT ACC 
T6ACGGT AAT ATAAT AGTTTCTACCGGTTGTTCTAACCTAATT CACCTCTCTCTCCATGG 

11723 b*H, 11755 kpnl, 

ThrAl*TyrGluLy«Al*V*l Al*Ph«S«rPh*LeuL*un»tTyrS*rTrpAanProL»u 
11761 ACGGCAT ATGAAAA6GCAGTGGCTTTCAGTTTCTTGTTGAT6TACTCCTGGAATCCACTT 
TGCCG7ATACTTTTCCGTCACCGAAA6TCAAA6AACAACTACATGAGGACCTTAGGTGAA 

11765 nd«l» 

y£lAroAroIl»Cy*L»uLPuV*lU«u8»r61nHi«ProGlaThrAt«ProS«rThr61n 
11821 GTAAGGA6&ATTT&TCT CCTGGTTCTTT CACAGCATCCAGAAACA6CTCCATCAACCCAG 
CATTCCT CCTAAACAGAGGACCAAGAAA6TGTCGTAGGTCTTT6TCGAGGTA6TTGGGTC 



naai 



11941 



ThrS«rTyrTyrTyrtys61yA«pPrall*61yAl *TvrLy«A«pV« 1 1 IvGlylytAgn 
ACCT CTTACTATT AT AAAG6AGACCCAATAG6GGCCTAT AAAGATGTT ATAG6AAAAAAT 
TGGAGAATGAT AATATT TCCT CT6GGT TATCCCC6GATATTTCT ACAAT ATCCTTTTTT A 

LvuScrGluL»uLy»ArgThrGlyPh»GluLy»L*uA L*AsnL*uAsnL*u9*rL*u&*r 
CTGAGT6AACT AAAAAGbACGGGTT TTGAAAAATTGGCT AATCT AAATCTAAGCCTGTCC 
GACT CACTTGATTTTTC CT6C CCAAAACTTTTT AACCGATTA6AT TTAGATTCG6ACAGG 
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12102 ieorS, 12117 b*U f 
12169 *»nl ( 

*»• wimmmmmwmMBm 

122B4 pvull, 
S»rOP 

4,101 tramimxutttt^^ 

42341 KratmKrcrc»xttH^ 

12*52 AAtXl, 12tS7 noQl v 

^2^ht TTAACA6CCCCA 
AATT6TC666ST 
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